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Preface 



In the first part of the present volume of LNCS, the reader will find the invited 
talks given at the MCU 2001 conference. In the second part, he/she will find 
the contributions that were presented at the conference after selection. In both 
cases, papers are arranged in the alphabetical order of the authors. 

MCU 2001 is the third conference in theoretical computer science. Machines, 
computations and universality, formerly. Machines et calculs universels. Both 
previous conferences, MCU’95 and MCU’98, were organized by Maurice Mar- 
genstern in Paris and in Metz (France), respectively. 

From the very beginning, MCU conferences have been an international scien- 
tific event. For the third conference, in order to stress that aspect, it was decided 
to hold it outside France. Moldova was chosen thanks to the close cooperation 
between the present chairmen of MCU 2001. 

MCU 2001 also aims at high scientific standards. We hope that the present 
volume will convince the reader that the tradition of previous conferences have 
been upheld by this one. Cellular automata and molecular computing are well 
represented in this volume. And this is also the case for quantum computing, for- 
mal languages, and the theory of automata. MCU 2001 does not fail its tradition 
of providing our community with important results on Turing machines. 

We take this opportunity to thank the referees of the submitted papers for 
their very efficient work. The members of the program committee gave us de- 
cisive help on this occasion. Thanks to them, namely, Erzsebet Csuhaj-Varjii, 
Gabriel Ciobanu, Serge Grigorieff, Manfred Kudlek, Yuri Matiyasevich, Liud- 
mila Pavlotskaya, Arto Salomaa, and Mephodii Ratsa, we can offer the reader 
this volume of LNCS. 

The local organizing committee includes E.Boian, C.Ciubotaru, S.Cojocaru, 
A.Colesnicov, V. Demidova, G.Magariu, N.Magariu, L. Malahova, T.Tofan,T.Ver- 
lan, and S.Verlan. We would like to thank all these people for their work, espe- 
cially Serghei Verlan for the preparation of this volume. We express our gratitude 
to all colleagues from the Institute of Mathematics and Computer Science of the 
Academy of Sciences of Moldova who have supported us in organizing this event. 
We have a special debt towards the director of the institute, Constantin Gaindric. 

Special thanks are due to the town of Chi§inau for its assistance, in particular 
to its mayor, Serafim Urecheanu. 

MCU 2001 could not have been held without the help of significant support- 
ers. We therefore thank the Lahoratoire d’Informatique Theorique et Appliquee, 
LITA, the University of Metz, the Pole Universitaire Europeen de Nancy-Metz, 
and especially INTAS project 97-1259, namely, its western partners, the Univer- 
sity of Metz and the University of Hamburg. 
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Maurice Margenstern 
Yurii Rogozhin 
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Three Small Universal Turing Machines 



Claudio Baiocchi 

Dipartimento di Matematica, Universita “La Sapienza” di Roma (Italy) 
baiocchi@mat . uniromal . it 



Abstract. We are interested by “small” Universal Turing Machines (in 
short: UTMs), in the framework of 2, 3 or 4 tape-symbols. In particular: 

— 2 tape-symbols. Apart from the old 24-states machine constructed 
by Rogozhin in 1982, we know two recent examples requiring 22 
states, one due to Rogozhin and one to the author. 

— 3 tape-symbols. The best example we know, due to Rogozhin, re- 
quires 10 states. It uses a strategy quite hard to follow, in particular 
because even-length productions require a different treatment with 
respect to odd-length ones. 

— 4 tape-symbols. The best known machines require 7 states. Among 
them, the Rogozhin’s one require only 26 commands; the Robinson’s 
one, though requiring 27 commands, fournishes an easier way to 
recover the output when the TM halts. In particular, Robinson asked 
for a 7 X 4 UTM with only 26 commands and an easy treatment of 
the output. 

Here we will firstly construct a 7 x 4 UTM with an easy recover of the 
output which requires only 25 commands; then we will simulate such a 
machine by a (simple) 10 x 3 UTM and by a 19 x 2 UTM. 



1 Tag Systems 

Let us recall some notions needed later on. We are given an integer d > 2 (the 
deletion number)] a “d-tag system” is nothing but a word-processor, acting on 
words W whose letters are the elements of a finite alphabet A. For each letter 
a € A, a, (possibly empty) word p{a) is given, that is called the “production” of a; 
the letters whose production is empty are called halting letters. Words of length 
strictly less than d and words starting with a halting-letter are called halting- 
words. The action of the word-processor on the generic word W is specified by 
the following rules: 

— if LF is a halting-word, the word-processor stops; 

— if not, W has the form W = aUV where: a is a non-halting letter; U is a 
word of length d—1; V is another (possibly empty) word. The original word 
W = aUV is then replaced with W = Vp{a); and the work continues on the 
new W thus obtained. 

For our purposes we can restrict ourselves to the case of deletion number d = 2; 
furthermore we can assume that all but one the halting letters are “virtual”, 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 1-|1^ 2001. 
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because (due to the form of the initial word W) they will never appear as initial 
letters; and that the length of the word W to be processed will always remain 
strictly greater than 1 (in the following, when speaking of “tag systems” , we will 
refer to such restricted case). 

Following the general approach suggested by Minsky (see e.g. |Mml) we will 
now construct some TMs that simulate tag-systems; they will be universal be- 
cause, in turn, TMs can be simulated by tag-system^. 

2 Simulation of Tag— Systems 

We will use the term “symbol” (or tape-symbol) to denote the elements of the 
TM’s alphabet, and the term “string” to denote words constructed with tape- 
symbols; terms “letter” and “word” will instead refer to the alphabet A of the 
simulated tag-system. We will set A = {oq, oi, 02 , . . . , a„}, the unique actual 
halting letter being ag; recall that the only case of halting corresponds to a word 
of length at least 2 whose first letter is oq. 

The initial tape of the simulating machine will be choosen in the form: 

r= P...S 



where: 

— dots denote zones of blanks; 

— the string S encodes (in a form we will detail in a moment) the word W to 
be processed; 

— the string V has the form: 



V = Pn+lPn---PlPo (1) 

where Pj encodes, in a suitable form, the production p(aj) (if we do not 
specify the index of a letter a G M, we will denote by P{a) the corresponding 
string in V). 

We will associate to any aj £ A a, positive integer Nj (again, if we do not specify 
the index of an a G M we will write N{a) instead of Nj). The definition of the 
function N will be given later on; for the moment let us say only that it must 
be injective. Once N has been choosen, we fix two tape-symbols p, v with p ^ v 
and we setQ 

J if W = 61&2 ■ • • is the initial word, . . 

\ we choose 5 = p . . . 

so that the exponent of v will specify (the function N being injective) which 
letter we are dealing with. 



^ the details can be found e.g. in the self-contained and very clear paper [Robj 
^ V stay for “normal” , p for “marker” 
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In particular let W start with a non-halting letter a, whose production is 
p{a) = in order to “elaborate” such a VU we must append to the 

right end of S the string H(a) = ^ ■ ^nd then kill the leftmost 

part of S (up to, and included, the second marker). In order to append S{a) it 
could be useful to have, somewhere on the left of the scanned zone, a “mirror 
image” if' (a) of S{a); thus it will be sufficient to copy each symbol of S' (a) at 
the end of 60 . 

The needed S' {a) will in fact be the rightmost part of P{a) (see formula m 
later on); the leftmost part of P{a) being a “super-marker”, signalling the end 
of S' (a). Of course, before copying the needed string, we must “reach” it; in 
other words, by using the initial part of S, we must “neutralize” the part 

of P which is on the right of P{a). This will be realized through a clever choice 
of the function N , which will be choosen in such a way that: 

{ if the initial letter of the word IT is a^, 

the corresponding initial string in S will force (3) 

the TM to “neutralize” Pq, Pi, .. . Pe-i- 

Such a neutralization will be the first part of the whole TM’s job; the second 
part will copy the string; the third phase will recover the neutralized part of P 
and will annihilate the part of S corresponding to the two initial letters of the 
starting word IT. 



3 A 7 X 4 UTM with 25 Commands 

We will choose as tape-symbols 0 (the blank) and a, (3, 7 . Concerning S, we will 
choose a as the normal symbol and 7 as the marker; concerning P, the normal 
symbol will be 0 and the marker will be /3; more precisely, a string like . . . 00/3 
will act as marker; a string like . . . /30/3 will act as super-marker. We will fix the 
form of P by setting: 

{ Pn+i = /3; P{a) = 0/3 for any halting letter a; 

for non halting letters, if p{a) = 61&2 ■ ■ - bk, we choose: (4) 

P{a) = 0/30^('"“)00/30^('"‘-i)00/3 . . . /30^('’=)00/30^('’i)00/3. 



Remark 1. In particular halting letters will all be encoded in P through the same 
string. This is not a problem, because virtual halting letters will never appear 
in first position in S, thus their representations in P do not need to differ each 
one other (but their codes in S do need to differ. . . ). On the other hand, if we 
sort A by putting all virtual halting letters at the end, we could suppress their 
representation in T% 

® S' {a) is read from right to left, thus we ask for the mirror image. Of course we do 
not need a true copy: e.g. both markers and normal symbols used in V could differ 
from the ones [p,, v) used in 5. . . 

^ but the leftmost part, P„+i = /3, is still needed 
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Remark 2. The neutralization of a Pj will simply consist in replacing each 0 
by an a. Let us assume that the needed neutralization has been done; and let 
P{a) be the rightmost non-neutralized part of V. The string P{a) ends with 
a substring like /30“^00/3; its final part 00/3 will act as a first marker. Meeting 
such a marker, the TM will change it into aa(3 and will append a marker 7 on 
the right of S. Coming back, the TM will encounter f30^aaf3 and each 0 if anjj^ 
will be changed into an a and that will force the TM to append an a. The work 
continues with the next (on the left) string like /30^00/3; and so on, until we reach 
(instead of a /30^00/3) the super-marker /30/3 (the leftmost (3 being the rightmost 
symbol of the closer P on the left, possibly Pn+i); this ends the second step of 
the work, and the restore phase will start. 

Now we need an explicit formula for the function N; we will choose: 

No:=0;Nk-.= Nk-i + l+\p{ak)\ (fc = l,...,n) (5) 

where |X| denotes the length of the word X; remark that the “+1” ensures the 
injectivity of the function N. 

Remark 3. In particular, see 0 , the actual halting letter oq will be represented 
in S by an empty-string; its presence in the middle of W will correspond to a 
substring 77 in 5; the presence of cq in first (resp. last) position will correspond 
to the fact that the string S starts (resp. ends) with a 7 . Halting-words in the 
tag-system thus correspond to S starting with a 7 ; in such a case the TM will 
simply halt. 

With the choices @ and (0 the reader will have no difficulties to check that 
property m holds if, in the first phase of the TM’s work, each a of the initial 
part in S neutralizes up to and included the first substring 0/3 on its left; 

thus the initial TM’s can be regulated by the following piece of table where, in 
first column, we indicate by arrows the direction the head is moving to: 



Table 1. The first part of the TM’s work. 



Direction 


State 


0 


a 


(3 


7 


Description 




A 


□ 


OCB 


□ 


Halt 


The whole TM’s work 
starts here 




B 


OCB 


OCB 


pcc 


□ 


Nentralization in V 




C 


aTZD 


OCB 


□ 


□ 


Doubt 




D 


alZD 


OCB 


I3TID 


aCE 


Neutralization in S 



® the letter ao corresponds to a: = 0; see the following formula 0 
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Remark 4- We want to point out that cells marked as refer to cases we will 
never encounter if the initial tape corresponds to the simulation of a tag-system; 
in particular we could fill them with any command without affecting the work 
(and the universality) of our TM. On the contrary, the cell marked “Halt” must 
remain empty: the machine must halt if the string S starts with the marker 7 
(see RemEJ- 

Concerning the second step of the TM’s work almost any detail has been 
given in Rem 12 the corresponding piece of table is given by: 



Table 2. The second part of the TM’s work. 



Direction 


State 


0 


a 


(3 


7 


Description 




E 


aJZI 


aCE 


f3£.F 


'yCE 


searches leftward 




F 


aCG 


aCE 




□ 


doubt 




G 


aUH 




[3TIJ 


□ 


decided 




H 


jCE 


aTLH 


(3TLH 


-yUH 


appends 7 




I 


aCE 


am 


pm 


777/ 


appends a 



and the third step requires just one line: 

Table 3. The third part of the TM’s work. 



Direction 


State 


0 


a 


P 


7 


Description 


- 


J 


□ 


077J 


pnj 


aTZA 


Restore 



We promised a 7 x 4 UTM and we wrote a 10 x 4 table; however, according 
with RemEl nothing forbids to suppress state C, by replacing anywhere a switch 
to state C with a switch to state D; two other similar groupments give raise to 
a 7 X 4 UTM; see Table 0. Let us point out the major differences with respect to 
the 7x4 UTMs of Minsky jMin) . Rogozhin jbo2j and Robinson |Robj . In these 
three TMs: 

— the super-marker is given by a couple (3(3 of markers; 

— almost each run of the machine swaps the markers {(3 i — > 7 ) . 
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We choosed as the super-marker the string /30/3; this choice, and the fact that 
we never swap the markers, allows to gain one empty cell in the table (column 
7 of row B] the swap of the markers would require to use such a cell for other 
purposes) . On the other hand, our UTM halts exactly where the simulated tag- 
system would stop: when the machine halts, the output (to be interpreted in 
terms of tag-systems) starts from the 7 scanned by the Head and ends on the 
first blank on the right. 



Table 4. A 7 x 4 UTM with 25 commands and easy recover of the output. 



Old names 


New names 


0 


a 


/3 


7 


A,B 


T 


0£T 


QCT 


pcu 


Halt 


C,D 


U 


a7^^7 


QCT 


pnu 


aCV 


E 


V 


aTZZ 


aCV 


I3CW 


7£U 


F 


w 


aCX 


aCV 






G,J 


X 


aTLY 


QTZX 


pnx 


07^^ 


H 


Y 


-yCV 


aUY 


pnY 


77^T 


I 


Z 


aCV 


aTZZ 


pnz 


77^Z 



4 A 10 X 3 UTM 

We come back to the original 10 states of Tables[H 0 and El because, with respect 
to Table El some different groupment will be needed. In order to work with only 
3 tape-symbols we will now replace the marker 7 by the string /3/3; this choice 
(say “the marker in S is twice the marker in P”) was already used by Rogozhin 
in its 10 X 3 UTM0. Concerning row G in Table El it is sufficient to suppress the 
(empty) column 7 ; concerning row A in Table 0 we just need to (suppress the 
column 7 and) put the Halt in column /3; also for row / in Table El we confined 
ourselves to suppress the column 7 (thus row I will, as needed, leave unchanged 
the strings Pf3); concerning rows E,F of Table 0we just did a little change: we 
suppressed of course column 7 in both rows, but the (empty) column /3 in row 
F needs now to be filled with (3TZF (thus accomplishing the work of column 7 
in the old row F). All remaining rows require many steps (and possibly many 
states; e.g. state H splits into H,Hi,H 2 ) for the simulation. 



see [R o2j . where our symbols 0, a, (3 are denoted by 0, 1, 6 respectively 
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Table 5. States without horizontal separating line can be grouped together, thus 
getting a 10 X 3 UTM. 



State 


0 


a 


P 


E 


aTZI 


aCE 


PCF 


F 


aCG 


aCE 


PCE 


H 2 


□ 


□ 


PCE 


G 


allH 


□ 


07^Jl 


J 


□ 


07^J 


07^Jl 


Ji 


□ 


aCJ 2 


07^H 


J 2 


(5TU 


□ 


□ 


H 


PTLHi 


aTLH 


PTIH 


I 


ajCE 


aTZI 


PTZI 



State 


0 


a 


p 


A 


□ 


OCB 


Halt 


Hi 


PCH 2 


□ 


□ 


B 


OCB 


OCB 


PCG 


C 


aTZD 


OCB 


□ 


D 


aTZD 


OCB 


PTZDi 


Di 


aTZD 


□ 


aCD 2 


D 2 


□ 


□ 


aCD^ 


D 3 


□ 


otCE 


□ 



Remark 5. Row Hi could be grouped both with J or with i? 2 , but it is more 
convenient to group it with row A, and to absorbe row H 2 into (the new form of) 
row F. The most tricky simulations concern column (3 in row J, where we wrote 
a 0; the choice is not very natural, but allows to treat any case by just one more 
state (the union of J\ and J 2 ). Concerning row G, as already said, it could be 
treated by simply suppressing the column 7 ; however nothing forbids of apply 
to it the same treatment used for row J: in column (3 we can write (YR-Ji. The 
choice, already not so natural in the treatment of row J, can here seem absurd. 
However it works: the head, in state Ji, will meet an a that forces the TM to 
come back and write the needed (3. In such a way row G can be joined to row 
J (because we did not join J and Hi); and row H\ can be joined to row A thus 
giving raise to the 10 x 3 UTM of Table 0 



Remark 6. Concerning the structure of the whole machine, we find our TM 
somewhat simpler than the Rogozhin’s one: we do not need to differentiate the 
treatment for even-length and odd-length productions; and the couple (3j3 acts 
simply as a markeiQ. 



^ in the Rogozhin’s UTM the second (3 of the couple acts as “first normal symbol a 
of the following block”; this operation being compensated by the fact that, when 
writing the marker jSP on the right of S, the leftmost j3 of the couple overwrites the 
last a already written. . . 
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5 A 19 X 2 UTM 

A fundamental result of Shannon |Sha] states that any Turing Machine with m 
states and n symbols can be simulated by a 2-symbols TM with m ■ 4>{n) states, 
where the function 4> is given by: 

let I be minimal such n < 2^; then (j>{n) = 3 • 2* + 2Z — 7. 



Remark 7. The Shannon result is the first step for the construction of small 
UTMs; the second step being the fact that any 2-symbols TM can be simulated 
by a tag system, and the third one being that (as we shortly explained) tag 
systems can be simulated by a small Turing Machine. Details can e.g. found in 
|Hobf . where is also given an easier proof (at the expenses of a worst value for 
(j>) of the Shannon’s result. 

The first step in the Shannon’s technique transforms the original tape by replac- 
ing each old symbol with its /-digits binary description@. If applied to 7 x 4 TMs 
the Shannon’s formula gives the existence of a simulating 63-states TM; perhaps 
the factor ^(4) = 9 in the Shannon’s formul^, one can expect that better values 
should be obtained by using some peculiarity of the table (empty cells, repeated 
commands,. . . ); in fact, applied to the Robinson’s UTM, it can be lowered to a 
(quite astonishing) value close to 3: we proved in piaij that such a UTM can be 
simulated by a 22 x 2 TM0. 

Also our 7x4 UTM can be simulated by a 22 x 2 TM; however, in order 
to prove a better result, we will follow a different strategy, using variable length 
codes: the marker (3 will be replaced by a one-digit symbol, while a, 7 and 0 will 
be replaced by two-digits symbols. 

With respect to the original 10 states (see Tables D 0 and 01) we will “dupli- 
cate” the old symbol a, by replacing it somewhere with an 7^ and somewhere 
with an tr, thus getting a 10 x 5 table; the replacement being done in such a 
way that arrrows over the a denote the direction along which the Head moves 
when it arrives to scan such a symbol. It is not a difficult task: in the initial tape 
the symbol a appears only in S (thus it will be scanned “coming from the left”) 
and we replace it by ; in the table commands like aTZK must be replaced by 
while commands like aCK must be replaced by itCK. The result is 
given in Table El 

Remark 8. In row J column 7 of Table El we wrote “tF77.A” instead of “072.A”, 
as in Table 01 it is quite obvious that this does not affect the behaviour of the 
UTM. In fact, concerning the next symbol to scan, we have two cases: it can be 
a 7 , or a . In the first case the machine will halt (thus the tr we wrote has 
no relevance); in the second case the machine will come back in state B, where 
symbols and 0 are treated in the same way. 

® of course initial zeros can not be suppressed! 

® factor which is sharp in the framework of the simulation of any 4-symbols TM 
a 22 X 2 UTM was also obtained, by direct construction, in Rogozhin [Ro.'H 
when the symbol we are writing will be read again, we will arrive to it moving left! 
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Table 6. An intermediate 10 x 5 table. 



State 


0 






/3 


7 


A 




QCB 


□ 


□ 


Halt 


B 


OCB 


□ 


QCB 


PCG 


□ 


C 


isTLD 


□ 


QCB 


□ 


□ 


D 


^TZD 


QCB 


□ 


/3TZD 


~dCE 


E 


ixTLI 


□ 


l^CE 


PCF 


'yCE 


F 


~dCG 


□ 


~dCE 


□ 


□ 


G 


^TLH 


□ 


□ 


PIIJ 


□ 


H 


jCE 


^TZH 


□ 


/3TZH 


'jTZH 


I 


~dCE 


i^TLl 


□ 


pni 


'yTLI 


J 


□ 


QTU 


□ 


I3TU 


^TZA 



With respect to Table |H| let us now rename the five symbols 0, 7^, tr, (3 
and 7 by "00", "01", "10", "1" and "11" respectively; the next step consists in 
interpreting such new symbols as a single-digit symbol (the "1") or as 2-digit 
symbols (the remaining one J^ . Let us point out the main reason for our choice 
of variable-length codes: independently from the direction we are walking in, 
when we scan a 0 we are sure it is the starting of a normal symbol; conversely, 
when scanning a 1, it must be a marker (or possibly the beginning of a marker). 
With such a remark in mind, it is nothing but a tedious work to check that 
Table 0 simulates Table 0 

Let us just add a few words about the treatment of state G. In such a state 
we can meet only a 1 or a 00; in the first case we must (and we do, with the 
command VJZJ) leave the 1 unchanged, going right in state J; in the second case 
we should replace 00 by 10, going right in state H; the command ICH we wrote, 
after two more TM’s steps, has exactly this effect. 
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Table 7. States in the rightmost part can be replaced as indicated; joining states C 
and Ji we get a 19 x 2 UTM. 



State 


0 


1 


State 


0 


1 


State 


0 


1 


A 


17^Al 


Halt 








Ai -s- Di 


□ 


QCBi 


B 


OjCBi 


ICC 


Bi 


OCB 


QCB 








C 


OCCi 


□ 


Cl 


mc 2 


0 

hi 

to 


C 2 Di 


07^^) 








D 


nZDi 


nzD 2 


Di 


OIZD 


OCBi 














D 2 


moi 


ICD 3 


D 3 El 


□ 


OCE 


E 


ICEi 


1 CE 3 


El 


11 ZE 2 


OCE 


E 2 II 


□ 


QIZI 








E 3 


ICFi 


ICE 














El 


OCG 


OCE 


F Es 


ICFi 








G 


ICH 


mj 














H 


nzHi 


nzH 


Hi 


ICH 2 


OIZH 


H 2 — >■ E 3 


□ 


ICE 


I 


mil 


mi 


h 


1 CI 2 


QIZI 


I 2 — >■ El 


□ 


OCE 


J 


07^Jl 


11 ZJ 2 


Ji 


□ 


07^J 














J 2 


07^Jl 


07^A 
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Abstract. We present a biological computing paradigm that is based on 
genetic regulatory networks. Using a model of gene expression by piece- 
wise linear differential equations we show that the evolution of protein 
concentrations in a cell can be considered as a process of computation. 
This is demonstrated by showing that this model can simulate mem- 
ory bounded Turing machines. The simulation is robust with respect 
to perturbations of the system, an important property for both analog 
computers and biological systems. 



1 Introduction 

In recent years scientists have been looking for new paradigms for constructing 
computational devices. These include quantum computation PJ, DNA compu- 
tation P), neural networks j2], neuromorphic engineering P] and other analog 
VLSI devices. This paper describes a new paradigm based on genetic regulatory 
networks. The concept of a “genetic network” refers to the complex network of 
interactions between genes and gene products in a cell 0. Since the 60’s genetic 
regulatory systems are thought of as “circuits” or “networks” of interacting com- 
ponents 0, and were described in computer science terms: The genetic material 
is the “program” that guides protein production in a cell; protein levels deter- 
mine the evolution of the network at subsequent times, and thus serve as its 
“memory” . This analogy between computing and the process of gene expression 
was pointed out in various papers I3HI Bray suggests that protein based circuits 
are the device by which unicellular organisms react to their environment, instead 
of a nervous system 0. However, until recently this was only a useful metaphor 
for describing gene networks. The papers [911 Oj describe the successful fabrica- 
tion of synthetic networks, i.e. programming of a gene network. In this paper 
we compare the power of this computational paradigm with the standard digital 
model of computation. In a related series of papers it is shown both theoretically 
and experimentally that chemical reactions can be used to implement Boolean 
logic and neural networks (see [I Ij and references therein). 

Protein concentrations are continuous variables that evolve continuously in 
time. Moreover, biological systems do not have timing devices, so a description in 
terms of a map that simultaneously updates the system variables is inadequate. It 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 11-|2^ 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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is thus necessary to model gene networks by differential equations, and consider 
them as analog computers. The particular model of genetic networks we analyze 
here assumes switch-like behavior, so that protein concentrations are described 
by piecewise linear equations (see equation (Pi below) |12I8| . but we believe our 
results to hold for models that assume sigmoid response (see discussion). These 
equations were originally proposed as models of chemical oscillations in biological 
systems m- In this paper we make the analogy between gene networks and 
computational models complete by formulating an abstract computational device 
on the basis of these equations, and showing that this analog model can simulate 
a computation of a Turing machine M- The relation between digital models of 
computation and analog models is explored in a recent book PS!, mainly from 
the perspective of neural networks. It is shown there that analog models are 
potentially stronger than digital ones, assuming an ideal noiseless environment. 
In this paper on the other hand we consider the possibility of noise and propose a 
design principle which makes the model equations robust. This comes at a price: 
we can only simulate memory bounded Turing machines. However, we argue that 
any physically realizable robust simulation has this drawback. On the subject 
of computation in a noisy environment see also jltill Y) . We found that the gene 
network proposed in 0 follows this principle, and we quote from that paper 
their statement that “theoretical design of complex and practical gene networks 
is a realistic and achievable goal” . 

Computation with biological hardware is also the issue in the field of DNA 
computation |2j. As a particular example, the guided homologous recombination 
that takes place during gene rearrangement in ciliates was interpreted as a pro- 
cess of computation m- This process, and DNA computation in general, are 
symbolic, and describe computation at the molecular level, whereas gene net- 
works are analog representations of the macroscopic evolution of protein levels 
in a cell. 



2 Piecewise Linear ODEs for Gene Networks 

In this section we present model equations for gene networks, and note a few 
of their dynamical properties P2|. The concentration of N proteins (or other 
biochemicals in a more general context) is given by N non-negative real variables 
yi, ■ ■ ■ ,Vn- Let 9i, ... ,0^ be N threshold concentrations. The production rate 
of each protein is assumed to be constant until a threshold is crossed, when the 
production rate assumes a new value. This is expressed by the equations: 

^ = -kiy^ + Ai{Yi, . . . ,Yn) , ( 1 ) 

where Yi is a Boolean variable associated with equal to 1 if > 9i and 0 
otherwise; ki is the degradation rate of protein i and Ai is its production rate 
when gene i is “on”. These equations are a special case of the model of Mestl et. 
al. [El where each protein can have a number of thresholds, compared with just 
one threshold here (see also jZDl). When there is just one threshold it is easy to 
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associate Boolean values with the continuous variables. For simplicity we take 
fci = 1, and define 

Xi = yi- 0i, 

with the associated Boolean variables 



Xi = sgn{xi), 

where sgn(a;) = 1 for a; > 0 and zero otherwise. Also denote Ai(Xi, . . . ,Xn) = 
Ai{Yi, . . . , Yjq) — 9i. Equation (Pi now becomes: 

dx ■ 

-^ = -Xi + A,{Xi,...,XN), (2) 

and Ai is called the truth table. The set in which corresponds to a particular 
value of a Boolean vector X = {X\, . . . ,Xn) is an orthant of By abuse 
of notation, an orthant of IR^ will be denoted by a Boolean vector X. The 
trajectories in an orthant are straight lines directed to a focal point A{X) = 
(Ai(A), . . . , ylAf(X)), as seen from equation (0) below. If the focal point A{X) at 
a point X = {x\, . . . , xn) is in the same orthant as x, then the dynamics converges 
to the focal point. Otherwise it crosses the boundary to another orthant, where 
it is redirected to a different focal point. The sequence of orthants X(l), A(2), . . . 
that correspond to a trajectory x(t) is called the symbolic dynamics of the vector 
field. In the next sections we will associate the symbolic dynamics of a a model 
Gene Network (GN) with a process of computation. 



2.1 Dynamics 

In an orthant all the Ai are constant, and the equations 0 are easily integrated. 
Starting from a point x(0), 

Xi(t) = Xi + (xj(0) - Ai)e“‘ , (3) 



where Ai = Ai(Ai(0), . . . ,Xiv(0)). The time ti at which the hyper-plane Xi = 0 
is crossed is given by 



ti = In 



Aj - Xi(0) 

A, 



The switching time tg is the time it takes to reach a new orthant: 



ta = min ti 

i 

We will consider networks with Ai = ±1 in all orthants. Thus when the focal 
point is in a different orthant than x(0) we have that 

G < In 2 . (4) 

This gives a criterion for determining whether a GN is converging to a fixed point: 
if the time from the last switching time is larger than In 2, then the system is 
converging to the current focal point. 
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Fig. 1. A piecewise linear flow in two dimensions. 



A network as in 0 is a continuous time version of a discrete network: 

Zi sgn{Ai{Z)) , (5) 

where Z G {0, l}'^ is a vector of Boolean variables. This dynamics does not 
necessarily visit the same sequence of orthants as the corresponding continuous 
network. 

We want to simulate a discrete dynamical system (Turing machine) with a 
continuous time GN. To bridge the gap, we first construct a discrete gene network 
of the form whose corresponding continuous GN has the same symbolic 
dynamics. We will use truth tables with the following property: 

Definition 1. Two orthants are said to be adjacent if they differ in exactly one 
coordinate. A truth table A will be called adjacent if A(X) is in the same orthant 
as X or in an adjacent orthant to X for all x G A network with an adjacent 
truth table will also be called adjacent. 

We note that in an adjacent network, all initial conditions in an orthant lead 
to the same adjacent orthant. Therefore all initial conditions of © in the same 
orthant have the same symbolic dynamics. An immediate result is the following 
lemma: 

Lemma 1. Let A be an adjacent truth table, then the symbolic dynamics of 
a continuous GN is the same as the dynamics of its discrete counterpart ©).' 
X{k) = Z(k), fc = 0, 1, . . . , where Z{k) is the kth iterate of (0 and X{k) is the 
kth orthant visited by ©), for every initial condition of corresponding to Zq. 
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In view of the above discussion, the focal points and the initial condition of 
an adjacent network can be taken as points in {—1,1}^, and the problem of 
constructing a continuous network that simulates a discrete dynamics is reduced 
to the problem of constructing a discrete network that changes only one variable 
at each iteration. 

When the truth table is not adjacent, a discrete network may show qual- 
itatively different dynamics than its continuous counterpart: continuous high- 
dimensional GN’s are “typically” chaotic I21I22I . However chaos is a dynamical 
behavior that is impossible in dynamical systems with a finite state space. The 
lack of sensitivity of the dynamics of an adjacent GN to the placement of the 
initial condition, and its equivalence to a discrete network leads to the following 
statement: 

Corollary 1. Adjacent networks are not chaotic. 

This implies a measure of stability to the dynamics of adjacent networks. Lack of 
sensitivity to perturbations of the system is captured by the following property 
of adjacent networks: 

Claim. Let A be an adjacent truth table, and let H be a truth table whose entries 
are Ai{X) + Si{X), where Si{X) S [— c, c] for some 1 > c > 0. Then A is also 
adjacent, and the two networks have the same symbolic dynamics. 

The robustness of adjacent networks is an important property for both analog 
computers and biological systems. The robustness of biological systems leads us 
to speculate that adjacency might be a principle underlying the robust behavior 
in the modeled systems. Adjacency can also be taken as a principle which can 
be used in the design of robust networks. 

3 Preliminaries 

In this section we give the definition of the Turing machine that will be simulated 
and provide the relevant concepts from complexity theory HH. 

Definition 2. A Turing machine is a tuple M = {K, X, r,6,Qi,Qq). K = 
{Qi, . . . ,Qq} is a finite set o/ states; Qi,Qq € K are the initial/halting states 
respectively; S is the input alphabet; T = E U {blank, is the tape alphabet 
which includes the blank symbol and the left end symbol, ff, which marks the 
end of the tape; S : K x T K x S x {L, i?} is the transition function, and 
L/R signify left/right movement of the read-write head. The transition function 
is such that it cannot proceed left of the left end marker, and does not erase it, 
and every tape square is blank until visited by the read-write head. 

At the beginning of a computation an input sequence is written on the tape 
starting from the square to the right of the left-end marker. The head is located 
at the leftmost symbol of the input string, and the finite-control is initialized at 
its start state Q\. The computation then proceeds according to the transition 
function, until reaching the halting state. Without loss of generality we suppose 
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that the input alphabet, S, is {0,1}. We say that a Turing machine accepts an 
input word w if the computation on input w reaches the halting state with “1” 
written on the tape square immediately to the right of the left-end marker. If 
the machine halts with “0” at the first tape square, then the input is rejected by 
the machine. Other conventions of acceptance are also common. The language 
of a Turing machine M is the set L{M) of strings accepted by M. 

Languages can be classified by the computational resources required by a 
Turing machine which accepts them. The classification can be according to time 
or space (memory). The time complexity of a computation is the number of 
steps until halting and its space complexity is the number of tape squares used 
in the computation. The complexity classes P and PSPACE are defined to be 
the classes of languages that are accepted in polynomial time and polynomial 
space, respectively. More formally, a language L is in PSPACE if there exists 
a Turing machine M which accepts L, and there exists c > 0 such that on all 
inputs of length n M accesses 0{rf) tape squares. To make a finer division one 
defines the class SPACE(s(n)) of languages that are accepted in space s{n). 

3.1 On the Feasibility of Turing Machine Simulation 

Turing machine simulations by differential equations appear in a number of pa- 
pers: in it was shown that an ODE in four dimensions can simulate a Turing 
machine, but the explicit form of the ODE is not provided. Finite automata 
and Turing machines are simulated in m by piecewise constant ODEs. Their 
method is related to the one used here. 

A Turing machine has a countably infinite number of configurations. In order 
to simulate it by an ODE, an encoding of these configurations is required. These 
can essentially be encoded into a continuum in two ways: 

— in a bounded set, and two configurations can have encodings that are arbi- 
trarily close; 

— in an unbounded set, keeping a minimal distance between encodings. 

In the first possibility, arbitrarily small noise in the initial condition of the sim- 
ulating system may lead to erroneous results, and is therefore unfeasible (this 
point is discussed in in the context of simulating a Turing machine by a map 
in IR^). The second option is also unfeasible since a physical realization must be 
finite in its extent. Thus simulation of an arbitrary Turing by a realizable analog 
machine is not possible, and simulation of resource bounded Turing machines is 
required. In this chapter we will show a robust simulation of memory bounded 
Turing machines by adjacent GN’s. 

4 Computing with GN’s 

In this section we formulate GN’s as computational machines. The properties of 
adjacent networks suggest a natural interpretation of an orthant A as a robust 
representation of a discrete configuration. The symbolic dynamics of a network. 
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i.e. the values X{t) will be interpreted as a series of configurations of a discrete 
computational device. 

Next we specify how it receives input and produces an output. A subset of 
the variables will contain the input as part of the initial condition of the network, 
and the rest of the variables will be initialized in some uniform way. Since we 
are considering E = {0, 1}, the input is encoded into the binary values A(0). To 
specify a specific point in the orthant X we choose -1 to correspond to 0 and 1 to 
correspond to 1. Note that for adjacent networks any value of x(0) corresponding 
to A(0) leads to the same computation, so this choice is arbitrary. 

In addition to input variables, another subset of the variables is used as out- 
put variables. For the purpose of language accepting a single output variable is 
sufficient. There are various ways of specifying halting. One may use a dynam- 
ical property, namely convergence to a fixed point, as a halting criterion. We 
noted that convergence to a fixed point is identified when after a time In 2 no 
switching has occurred (see equation 0) ) . Non-converging dynamics correspond 
to non-halting computations. While such a definition is natural from a dynamics 
point of view, it is not biologically plausible, since a cell evolves continuously, 
and convergence to a fixed point has the meaning of death. Another approach 
is to set aside a variable that will signify halting. Let this variable be Xjy. It 
is initially set to 0, and when it changes to 1, the computation is complete, 
and the output may be read from the output variables. In this case non-halting 
computations are trajectories in which Xjy never assumes the value 1. After Xjy 
has reached the value 1, it may be set to 0, and a new computational cycle may 
begin. A formal definition of a GN as a computational machine is as follows. 

Definition 3. A genetic network is a tuple G = (V, /, O, A, Xq, Xff), where V is 
the set of variables indexed by {1, . . . , N}; I QV , \I\ = n and O CV, |0| = m 
are the set o/input and output variables respectively; A : {0, 1}'^ — >■ {—1, 1}'^ is 
a truth table for a flow m; Xq G {—1,1}^ " is the initialization of variables in 
V\I. X]\j is the halting variable that is initialized to 0. A computation is halted 
the first time that Xpf = 1. 

A GN G with n input variables and m output variables computes a partial 
mapping 

/G:{o,ir^{o,ir, 

which is the value of Xi for i G O when the halting state Xjsi = 1 is reached. If 
on input w the net does not reach a halting state then fciw) is undefined. 

We wish to characterize the languages accepted by GN’s. For this purpose 
it is enough to consider networks with a single output variable, and say that a 
GN G accepts input w G {0, 1}* if /g(w) = 1. A fixed network has a constant 
number of input variables. Therefore it can only accept languages of the form 

L„ = Ln{ 0 ,ir, (6) 

where L C {0, 1}*. To accept a language which contains strings of various lengths 
we consider a family {G„}^i of networks, and say that such a family accepts a 
language L if for all n, L(Gn) = An- 
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Before we define the complexity classes of GN’s we need to introduce the ad- 
ditional concept of uniformity. A GN with N variables is specified by the entries 
of its truth table A. This table contains the 2^ focal points of the system. Thus 
the encoding of a general GN requires an exponential amount of information. 
In principle one can use this exponential amount of information to encode every 
language of the form LC\ {0, 1}”, and thus a series of networks exists for every 
language L C {0, 1}*. Turing machines on the other hand, accept only the subset 
of recursive languages E]. A Turing machine is finitely specified, essentially by 
its transition function, whereas the encoding of an infinite series of networks is 
not necessarily finite. To obtain a series of networks that is equivalent to a Turing 
machine it is necessary to impose the constraint that truth tables of the series 
of networks should all be created by one finitely encoded machine. The device 
which computes the truth table must be simple in order to demonstrate that 
the computational power of the network is not a by-product of the computing 
machine that generates its transition function, but of the complexity of its time 
evolution. We will use a finite automaton with output I2S1, which is equivalent to 
a Turing machine which uses constant space. The initial condition of the series 
of networks needs also to be computable in a uniform way, in the same way as 
the truth table, since it can be considered as “advice”, as in the model of advice 
Turing machines M We now define: 

Definition 4. A family of networks is called uniform if there exist 

constant memory Turing machines Mi, M 2 that compute the truth table and 
initial condition as follows: on input X G {0, 1}" Mi outputs A{X); M 2 outputs 
xq for Gn on input 1” . . 

With the definition of uniformity we can define complexity classes. Given a 
function s(n), we define the class of languages which are accepted by networks 
with less than s(n) variables (not including the input variables): 

GN{s{n)) = {L C {0, 1}* jthere exists a uniform family of networks 

s.t. L(G„) = Ln and N < s{n) + n} 

The class with polynomial s(n) will be denoted by PGN. The classes Adjacent- 
GN(s(n)) and Adjacent-PGN of adjacent networks are similarly defined. Time 
constrained classes can also be defined. 

5 The Computational Power of GN’s 

In this section we outline the equivalence between memory-bounded Turing ma- 
chines and networks with adjacent truth tables. We begin with the following 
lemma: 

Lemma 2. Adjacent-GN{s{n)) C SPACE{s{n)). 

Proof. Let L G Adjacent-GN(s(n)). There exists a family of adjacent GN’s, 
{GnhOLi with truth tables 4^”^ such that L(G„) = L„, and constant memory 
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Turing machines Mi, M 2 that compute and Xq. Since is an adjacent 
truth table, and we are only interested in its symbolic dynamics, we can use 
the corresponding discrete network. The Turing machine M' for L will run M\ 
to obtain the initial condition, and then run M 2 to generate the iterates of the 
discrete network. M' will halt when the network has reached a fixed point. The 
network Gn has s(n) variables on inputs of length n, therefore the simulation 
requires at least that much memory. It is straightforward to verify that 0(s(n)) 
space is also sufficient. 

The Turing machine simulation in the next section shows: 

Lemma 3. Let M be a Turing maehine working in spaee s{n), then there exists 
a sequence of uniform networks {Gn}ff^i with s{n) variables such that L{Gn) = 
L(M)n{0,l}". 

We conclude: 

Theorem 1. Adjaeent-GN{s{n)) =SPACE{s{n)) . 

As a result of claim |^] we can state that adjacent networks compute robustly. 
This is unlike the case of dynamical systems which simulate arbitrary Turing 
machines, where arbitrarily small perturbations of a computation can corrupt the 
result of a computation (see e.g. f2AlA/i\h\ ). The simulation of memory bounded 
machines is what makes the system robust. The above theorem gives only a lower 
bound on the computational power of the general class of GN’s, i.e.: 

Corollary 2. SPACE{s{n)) C GN{s{n)). 

One can obtain non-uniform computational classes in two ways, either by allow- 
ing advice to appear as part of the initial condition, or by using a weaker type 
of uniformity for the truth table. This way one can obtain an equivalence with 
a class of the type PSPACE/poly fH) . 

6 Turing Simulation by GN’s 

Since the discrete and continuous networks associated with an adjacent truth 
table have the same symbolic dynamics, it is enough to describe the dynamics of 
an adjacent discrete network. We show how to simulate a space bounded Turing 
machine by a discrete network whose variables represent the tape contents, state 
and head position. An adjacent map updates a single variable at a time. To 
simulate a general Turing machine by such a map each computational step is 
broken into a number of operations: updating the tape contents, moving the 
head, and updating the state; these steps are in turn broken into steps that can 
be performed by single bit updates. 

Let M be a Turing machine that on inputs of length n uses space s. With- 
out loss of generality we suppose that the alphabet of the Turing machine is 
E = {0,1}. To encode the three symbols {0,1, blank} by binary variables we 
use a pair of variables for each symbol. The first variable of the pair is zero 
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iff the corresponding tape position is blank; “0” is encoded as “10” and “1” is 
encoded as “11”. Note that the left end marker symbol need not be encoded 
since the variables of the network have numbers. We construct a network G 
with variables Yi, . . . , Bi , . . . , Pi , . . . , Pg', Qi, ■ . ■ ,Qq and auxiliary vari- 
ables B,Y,Q'i, . . . , Q'g, Cl, ... , C4. The variables 

Yi, ■ ■ ■ ,Yg', Bi, . . . , Bg 

store the contents of the tape: B^ indicates whether the square i of the tape 
is blank or not and Yi is the binary value of a non-blank square. The input 
is encoded into the variables Yi, . . . , Y„, B\, . . . , i?„. Since the Turing machine 
signifies acceptance of an input by the value of its left-most tape square, we take 
the output to be the value of Yi . The position of the read- write head is indicated 
by the variables 

Pi,---,Pg 

If the head is at position i then Pi = 1 and the rest are zero. The state of the 
machine will be encoded in the variables 

Qi, ■ . ■ , Qq 

where Qi is the initial state and Qq is the halting state of the machine. State i 
will be encoded by Qi = 1 and the rest zero. 

As mentioned above, a computation of the Turing machine is broken into a 
number of single bit updates. After updating variables related to the state or the 
symbol at the head position, information required to complete the computation 
step is altered. Therefore we need the following temporary variables 

— Y,B - the current symbol 

— Q'l, . . . ,Q'g - a, copy of the current state variables. 

A computation step of the Turing machine is simulated in four stages: 

1. Update the auxiliary variables Y, B,Qi, . . . ,Q'^ with the information re- 
quired for the current computation step; 

2. Update the tape contents; 

3. Move the head; 

4. Update the state. 

We keep track of the stage at which the simulation is at with a set of variables 
Cl , . . . , C4 which evolve on a cycle which corresponds to the cycle of operations 
(l)-(4) above. Each state of the cycle is associated with an update of a single 
variable. After a variable is updated the cycle advances to its next state. However, 
since a variable update does not always change the value of a variable, e.g. the 
machine does not have to change the symbol at the head position, and since the 
GN needs to advance to a nearby orthant or else it enters into a fixed point each 
update is of the form: 

If the variable pointed to by the cycle 
subnetwork needs updating - update it 
else 

advance to the next state of the cycle 
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Suppose that the head of M is at position i and state j, and that in the current 
step Yi is changed to value which is a non-blank, the state is changed to state 
k, and the head is moved to position z -|- 1. The sequence of variable updates 
with the corresponding cycle states is as follows: 

cycle state variable update 

0000 Y ^Yi 

0001 B ^ Bi 

0011 Q' ^ 1 

0111 Yi ^ Y^ update variable at head 

0110 Bi -<—1 Y^ non-blank 

1110 Pi+i ^ 1 new position of head 

1111 Pi ^0 erase old position of head 

1011 Qk ^1 new state 

1001 Qj t— 0 erase old state 

1000 Qj t— 0 prepare for next cycle 

At the end of a cycle a new cycle is initiated. 

On input w = W\W 2 ■ ■ • Wn, Wi € {0, 1} the system is initialized as follows: 

Yi = w^, i = I,..., s 

Bi = 1, i = 1, ... ,s 

Pi = 1 

<5i = 1 

all other variables: 0 

This completes the definition of the network. The computation of the initial 
condition is trivial, and computing the truth table of this network is essentially a 
bit by bit computation of the next configuration of the simulated Turing machine, 
which can be carried out in constant space. The network we have defined has 
0(s) variables, and each computation step is simulated by no more than 20 steps 
of the discrete network. □ 

Remark 1. It was pointed out that the Hopfield neural network is related to 
GN’s pH). Theorem n can be proved using complexity results about asymmetric 
and Hopfield networks found in j29i;i0] . However, in this case it is harder to 
define uniformity, and the direct approach taken here is simpler. 



7 Language Recognition vs. Language Generation 

The subclass of GN’s with adjacent truth-tables has relatively simple dynamics 
whose attractors are fixed points or limit cycles. It is still unknown whether non- 
adjacent GN’s are computationally stronger than adjacent GN’s. General GN’s 
can have chaotic dynamics, which are harder to simulate with Turing machines. 
We have considered GN’s as language recognizers: where the input arrives at 
the beginning of a computation, and the decision to accept or reject is based on 
its state when a halting state is reached. In the context of language recognition. 



22 



Asa Ben-Hur and Hava T. Siegelmann 



chaotic dynamics does not add computational power: given a chaotic system that 
accepts a language, there is a corresponding system that does not have a chaotic 
attractor for inputs on which the machine halts; such a system is obtained e.g., by 
defining the halting states as fixed points. However, one can also consider GN’s 
as language generators by viewing the symbolic dynamics of these systems as 
generating strings of some language. In this case a single GN can generate strings 
of arbitrary length. But even in this case chaos is probably not very “useful”, 
since the generative power of structurally stable chaotic systems is restricted to 
the simple class of regular languages m More complex behavior can be found 
in dynamical systems at the onset of chaos (see Id2lddl ) . Gontinuously changing 
the truth table can lead to a transition to chaotic behavior m- At the transition 
point complex symbolic dynamics can be expected, behavior which is not found 
in discrete Boolean networks. 

8 Discussion 

In this paper we formulated a computational interpretation of the dynamics 
of a switch-like ODE model of gene networks. We have shown that such an 
ODE can simulate memory bounded Turing machines. While in many cases 
such a model provides an adequate description, more realistic models assume 
sigmoidal response. In the neural network literature it is proven that sigmoidal 
networks with a sufficiently steep sigmoid can simulate the dynamics of switch- 
like networks I15I35I . This suggests that the results presented here carry over to 
sigmoidal networks as well. 

The property of adjacency was introduced in order to reduce a continuous 
gene network to a discrete one. We consider it as more than a trick, but rather as 
a strategy for fault tolerant programming of gene networks. In fact, the genetic 
toggle switch constructed in j0| has this property. However, when it comes to 
non-synthetic networks, this may not be the case - nature might have other ways 
of programming them, which are not so transparent. 
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Abstract. Quantum entanglement is behind various puzzling/mysteri- 
ous or paradoxical phenomena of quantum mechanics in general and of 
quantum information processing and communication (QIPC) in partic- 
ular. Or, using less strong terms, quantum entanglement leads to highly 
nonintuitive or even counterintuitive effects. Some of them have been 
known for long time. New are being discovered almost daily. At the 
same time quantum entanglement is considered to be a very powerful 
resource for QIPC and, together with quantum superposition, quantum 
parallelism and quantum measurement, to be the main resource of the 
(provable) exponentially larger power of QIPC with respect to the clas- 
sical information processing and communication. 

It is increasingly realized that quantum entanglement is at the heart 
of quantum physics and as such it may be of very broad importance 
for modern science and future technologies. It has also been to a large 
extent quantum entanglement that has led in quantum mechanics to 
asking fundamentally new questions and has brought a chance to see in 
a more profound way various features of the quantum world (and this way 
it can contribute to a new understanding of the foundations of quantum 
mechanics). There is therefore a large need to understand, analyze and 
utilize this resource. 

The aim of the paper is, on one side, to illustrate and analyze various puz- 
zling phenomena that are due to the quantum entanglement and related 
nonlocality of quantum mechanics, as well as computational and, espe- 
cially, communicational power of quantum entanglement. On the other 
side, the aim of the paper is to discuss main basic concepts, methods 
and results of the already very rich body of knowledge obtained recently 
at the attempts to understand, qualitatively and quantitatively, entan- 
glement, laws and limitations of its distribution as well as its properties, 
structures and potentials for applications. 

In order to demonstrate special power of quantum entanglement two 
puzzles “with science fiction overtones” are dealt with first. They look 
like exhibiting a telepathy. 



* Most of the paper has been written during the first author stay with the ERATO 
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1 Introduction 

Quantum physics has been well known for a long time for a variety of strange, 
puzzling, mysterious and even paradoxical phenomena that withstood numer- 
ous attempts to be dealt with by some of the brightest people in science. As 
a consequence, most of the (quantum) physicists got used to live with the un- 
derstanding that quantum physics is a superb theory all quantum physicists 
(should) know how to use, but actually nobody understands it fully. Puzzling, 
mysterious and paradoxical quantum phenomena actually did not seem to dis- 
turb most of the working (quantum) physicists too much. There was no conflict 
discovered between predictions of theory and experiments. Only philosophers 
of science, working on different interpretations, could come from time to time 
with some interpretations, with which usually only very few others fully agreed. 
The situation is quite well characterized by the observation that philosophers 
of science can hardly agree even on what the term “interpretation of quantum 
physics” should mean. 

Quantum theory performs its prediction role superbly, but lags behind con- 
cerning its other very basic task. Namely, to And explanations for all fundamental 
quantum phenomena; why they happen, how they happen, and how much they 
cost in terms of various physical resources. Quantum entanglement and quantum 
measurement are perhaps the best examples of such phenomena about which we 
still know far too little. 

Quantum entanglement is a subtle correlation between subsystems of a quan- 
tum system. A feature of the system that cannot be created by local operations 
that act on subsystems only, or by means of classical communications. 

Quantum entanglement is arguably the most specific of all inherently quan- 
tum phenomena. In spite of the fact that quantum entanglement and its non- 
locality implications were basically known for more than 65 years, quantum 
entanglement started to be of an intensive interest only after it has turned out 
that it can be useful for (quantum) teleportation, to assist in speeding up quan- 
tum information processing, to enhance capacity of quantum channels, to realize 
unconditionally secure cryptographic key generation and to make quantum com- 
munication in some cases much more efficient. Currently, it is believed that a 
better comprehension of quantum entanglement can have important impact also 
on our overall understanding of key quantum phenomena (and that it can lead 
to the view of the nature of quantum mechanics from a different angle as till 
now) . It is also expected that practical utilization of quantum entanglement can 
have a broad impact on the progress in making use of quantum resources for 
various areas of science and technology. 

A better understanding of quantum entanglement, of ways it is characterized, 
created, detected, stored, manipulated, transformed (from one form to another), 
transmitted (even teleported) and consumed (to do some useful work), as well 
as of various types and measures of entanglement, is theoretically perhaps the 
most basic task of the current QIP research. In short, quantitative and qualitative 
theory of the entanglement is much needed. 
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It is interesting also to observe that the whole field of quantum information 
theory, and especially quantum entanglement, is theory driven. It did not arise 
because of observations from some experiments or from very specific needs of 
some applications. 

Historically, quantum entanglement got a large attention in 1935 in connec- 
tion with the so called EPR-paper (Einstein et ah, 1935), and the related paper 
by Schrodinger (1935)J3 The EPR-paper concentrated on nonlocality quantum 
entanglement manifests and consequently on the question of the relation be- 
tween quantum theory concepts and physical reality. Schrodinger recognized 
profoundly non-classical correlation between the information which entangled 
states give us about the whole system and its subsystems. 

After 1935 entanglement has been seen as a troublemaker, and the problem 
was to get rid of it. Nowadays, it is seen as a precious resource and in the study 
of quantum entanglement much dominate those problems that are related to the 
need to understand and utilize entanglement as an information processing and 
communication resource. 



2 Puzzles 

Two entertaining puzzles discussed next are to illustrate power of entanglement 
to assist in creation of a sort of telepathy illusion - a pseudo telepathy. The first 
one is due to Hardy (1998). 



2.1 Stage Telepathy Puzzling Behaviour of Alice and Bob 

Alice and Bob are on a stage, very far from each other (so far that they cannot 
communicate), and they are simultaneously, but independently and randomly 
asked again and again either a “food question” or a “colour question”. 

— FOOD question: What is your favorite meal? 

ANSWER has to be either carrot or peas. 

— COLOUR question: What is your favorite colour? 

ANSWER has to be either green or red. 

The audience observes that their answers satisfy the following conditions: 

— If both are asked colour-questions then in about 9% of the cases they answer 

green. 

— If one of them is asked colour-question and answers green and the other is 
asked food-question then (s)he answers peas. 

— If both are asked food-questions they never both answer peas. 

^ The term entanglement is a free translation of the German term “Verschrankung” 
introduced by Schrodinger. 
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It is not difficult to show that within the classical physics there is no way 
that Alice and Bob could invent a strategy for their answers before they went to 
stage in such a way that the above mentioned rules would be fulfilled. However, 
there is a quantum solution, and actually quite an easy one. 

Let \p) and |c) be two arbitrary orthogonal states in H 2 and let 

|r) = a\p) + b\c), 

\g) = b\p)-a\c). 

Let Alice and Bob at the very beginning, before they went to stage, create two 
particles in the state 

1^) = ^(k)k) - a^\p)\p)), 

where is a normalization factor, and later each of them takes his/her particle 
with him/her to the stage. 

If any of them is asked the colour-question, then (s)he measures his/her 
particle with respect to the {|r), | 5 )}-basis and answers in accordance with the 
result of measurement. 

If any of them is asked the food-question (s)he measures his/her particle 
with respect to the {|p), |c)}-basis and responds in accordance with the result of 
measurement. 

It is a not difficult exercise to show that in this way Alice’s and Bob’s re- 
sponses follow the rules described above (9% comes from an optimization in one 
case). 



2.2 Guess My Number Puzzle 

A very nice technical result concerning the communication power of entangle- 
ment, obtained by van Dam, Hpyer and Tapp (1997), was attractively formulated 
into a puzzle, by Steane and van Dam (2000), as follows. 

Alice, Bob and Charles are located in isolated booths and are to guess, each 
time the moderator distributes secretly among them a number of apples, in 
total at most 4, potentially dividing some of the apples first into halves and 
distributing also halves, whether the total number of distributed apples was 
even or odd. If they guess correctly, they win. 

The guess is actually to be made by Alice and the only communication al- 
lowed between the contestants, Alice, Bob and Charles, is that each of Alice’s 
companions is allowed to send one bit to Alice (after seeing the portion of apples 
he got). 

Contestants are allowed, before going to isolated booths, to agree on a strat- 
egy and to take with them whatever they want. However, once they are in booths, 
they are not allowed to communicate in some other way than sending to Alice 
one bit at each round of game. 

The basic question is whether it is possible that contestants always win? That 
Alice is always able to determine whether the number of apples distributed is 
even or odd? 
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There is no way for Alice, using only classical communications, to guess in 
more than 75% cases the correct answer. However, there is the following quantum 
solution. 

— Before going to booths contestants create three particles in the GHZ state 
;^(|000) + |111)) and each of them takes with him/her one of the particles. 

— In the case Alice, Bob, and Charles get Xa, Xb or Xc apples, respectively, 
where Xa + Xb + Xc is an integer smaller than 5, then each of them applies 
to his/her particle the rotation 



|0)(0| + e“^ni)(l|- 



The resulting state is either 

i(|000) + |111)), if Xa + Xb + Xc is even, 
v2 

or 

^(|000) — |111)), if Xa + Xb + Xc is odd. 

V2 

— In order to enable Alice to determine which of the states they jointly own, 
each of the contestants applies on his/her qubit the Hadamard transforma- 
tion, getting together either the superposition of the basis states of even 
parity 

^( 1000 ) + | 110 ) + | 101 ) + 1011 )), 

or a superposition of states of odd parity 

^( 1001 ) + 1010 ) + | 100 ) + | 111 )). 

— Each of them then measures his/her qubit in the standard basis and Bob 
and Charles communicate the outcomes to Alice. They always win. 



3 Quantum Entanglement 

and Its Nonlocality Implications 



Quantum entanglement is a term used to denote inherently quantum and inher- 
ently global correlations between quantum subsystems of a composed quantum 
system. (The state of a composed classical system is composed of the states of 
its subsystems - this is not in general true for quantum systems) . The existence 
of nonlocal quantum phenomena, and the puzzles and mysteries this leads to, 
are its main implications. 
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3.1 Basics about Entanglement 

We first briefly overview the very basic concepts of quantum information theory 
related to quantum entanglement. For more concerning basic of QIPC see Gruska 
(1999, 2000). 

A pure state \(f>) of a bipartite quantum system A®B is said to be separable, 
or classically correlated, if |</>) = \(Pa) G \4>b), where \4>a) (|</'b)) is a pure state 
of the quantum system A {B). A pure state of A 0 B is called unseparable or 
entangled (or quantumly correlated), if it is not separable - if it cannot be 
decomposed into the tensor product of pure states of A and B. 

A mixed stat^EI (density matrix) p oi A® B is called separable if it can be 
written in the form p = PiPAp G PB,ii where all weights Pi are positive and 
PAi (PBi) is a density matrix of A (i?)|j A density matrix p is called unseparable, 
or entangled, if it is not separable. A density matrix is therefore entangled, if 
it is not a mixture of product states. 

An important example of entangled states in H2 ® H2 are Bell states: 

|^>+) = 4(|00) + 111)). I'^') = ^(|00) - 111)). 

= ^(| 01 )-| 10 )). 

The state is called singlet. It is often said that each of the Bell states 
contains one ebit of entanglement. 

There are several reasons why to take the above definition of entanglement 
of density matrices as the most general one, even if it does not seem to be very 
natural (and not that a density matrix p is entangled if it cannot be written as 
a tensor product of some density matrices of subsystems A and B, what would 
seem to be a natural generalization of the pure state case) . 

— A (mixed) state is not useful for quantum teleportation, see page^J if and 
only if it is separable in the above sense. 

— Neither a change of basis nor local operations can create an entangled state 
from unentangled ones. 

— For all natural measures of entanglement the amount of entanglement cannot 
be increased using local quantum operations and classical communications. 

Understanding, characterization, classification, quantification as well as ma- 
nipulation of multipartite entangled states is a very important and difficult prob- 
lem, perhaps the main current challenge in the quantum information theory, 
that needs to be explored in order to get more insights into the possibilities of 
distributed quantum computing and networks. The problem is also of special 

^ Throughout this paper we often use term “mixed state” also for density matrices. The 
rational behind is that two mixed states are undistinguishable if their corresponding 
density matrices are the same. 

® An equivalent definition is that p = 'Y^^Pi\4>){4'i\ ® where |</)i) and are 

pure states in two Hilbert spaces and pi > 0. 




Power, Puzzles and Properties of Entanglement 



31 



interest outside of QIP, for physics in general. For example, for understanding 
of dynamical systems - especially many-body systems with strong fluctuations 
and for improving information gathering facilities of experimental physics. 

Error correcting codes are one example of an important area where multipar- 
tite entanglement plays a crucial role and where quite a few insights have been 
obtained about it. 

A (mixed) state \p) of a m-partite quantum system represented by the Hilbert 
space i/P) (g) (g) . . . (g) is called separable if it can ve written in the 

form 

n 

P = ® Pi, 2 (g . . . © Pi,m, 

where each pi,j is a density matrix of and pt > 0. Otherwise, |(/)) is called 
unseparable or entangled. (Such a state cannot be created by m parties if 
the i-th party works with a quantum system corresponding to the Hilbert space 
and uses only local quantum operations and classical communications.) 

Entanglement of multipartite systems is theoretically a very complex issue 
and it has many degrees and forms, as discussed in Section El For example, if an 
TO-partite state is entangled it can still be separable if some of the parties get 
together and can perform global operations on the composition of their quantum 
subsystems. For example, if Ii, I 2 , ■■■ Ik is a partition of the set {1,2,..., m}, 
then we say that an m-partite state is I\ : I 2 '■■■■ Ik separable, if it is separable 
on the Hilbert space © ... © where is a tensor product 

of Hilbert spaces \ i G A}. 

For example, the so-called Smolin (2000) state 

1 4 

^ i=l 

where {\^t) | i G {1, 2, 3,4}} are all Bell states, is entangled in the Hilbert space 
A® B ® C ® D, where A, B, C, D are two dimensional Hilbert spaces, but it is 
{A,B} : {C,D}, {A,C} : {B,D} and {A,D} : {B,C} separable. 

Example. The so-called GHZ states are states of the form ^(|a5c)±|a6c)), 
where a,b,c G {0, 1} and at most one of a, b, c is one. They are neither {A, B} : 
{C} nor {A} : {B,C} nor (A, C} : {B} separable, if three parties are denoted 
as A, B and C. 



Special Mixed States. Several families of mixed states play important role in 
quantum information processing theory, especially the so called Werner states. 
They form a family of one parameter states 

Wf = /|<F-)(<F-| + l^(|iF+)(tf^+| + \<!>+){<P+\ + |<?-)(<?-|), 0 < / < 1 

A Werner state Wf (Wp) is entangled if and only if / > 5 . 
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3.2 Entanglement and Nonlocality 

Quantum theory implies, and (not fully perfect yet) experiments confirm, that a 
set of particles can be in an entangled state even if they are much space separated. 
As a consequence, a measurement on one of the particles may uniquely determine 
the result of measurement on much space separated particles. Einstein called this 
phenomenon “spooky action at a distance”, because measurement in one place 
seems to have an instantaneous effect at the other (very distant) place. In other 
words, measurements of entangled states may cause “ (weakly) nonlocal effects”. 
(The term “weakly” refers to the fact that such a nonlocality cannot be used to 
send messages.) 

It is this “quantum nonlocality” that entanglement exhibits, that belongs to 
the most specific and controversial issues of the quantum world. A set of particles 
in an entangled state can therefore be seen as a special quantum channel through 
which outcomes of one measurement can have immediate impact at much distant 
places. 

Nonlocality of the physical world is not a new issues. The existence of non- 
local phenomena has been assumed by Newton when he developed his theory of 
gravity. It has been later rejected by Einstein when he developed his theory of 
relativity. 

Various ways to see “nonlocal effects” are closely related to fundamental 
questions concerning the nature of physical reality and causation. 

The orthodox Copenhagen interpretation of quantum physics sees the basic 
reality behind quantum theory as “our knowledge” . This is well encapsulated in 
Heisenberg’s famous statement (see Stapp, 2000): 

The conception of the objective reality of the elementary particles has thus 
evaporated not into the cloud of some obscure new reality concept, but 
into the transparent clarity of a mathematics that represents no longer 
the behaviour of the particles but rather our knowledge of this behaviour. 

Einstein can be seen as the main representative of a quite different view. 
Namely, that it is not acceptable that a major physical theory would not refer to 
the physical reality. Well known are also positions from the EPR-paper (Einstein 
et ah, 1935), that a proper physical theory should have the following features: 

Completeness: “In a complete theory there should be an element correspond- 
ing to each element of reality.” 

Locality: “The real factual situation of system A is independent of what is 
done with system B, which is spatially separated from the former.” 
Reality: “If, without in any way disturbing a system, we can predict with cer- 
tainty (i.e., with probability equal to unity) the value of a physical quantity, 
then there exists an element of physical reality corresponding to this physical 
quantity.” 

In particular, they suggested that quantum states do not provide a complete 
description of the physical reality, and therefore there have to exist additional, 
the so called “hidden” variables, whose objective values would unambiguously 
determine the result of any quantum experiment. 
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3.3 Bell Theorem, Its Proofs and Experimental Verification 

Einstein, and many others, believed that such mysteries as entanglement seems 
to imply are due to the fact that the current quantum theory is incomplete, and 
he believed that by using some “hidden variables” one could develop a complete 
theory without non-local influences, where one could believe in the existence of 
an objective reality for quantum theory concepts. 

There have been several attempts to develop a hidden variable interpretation 
of quantum theory. Arguably the most successful has been one due to Bohm 
(1952). However, a simple, but ingenious, Gedanken experiment suggested by 
Bell (1964), led to the conviction, for many, that all attempts to develop a 
hidden variable theory of quantum physics have to fail. 

Bell showed that ideas suggested in the EPR paper, about locality, reality 
and completeness, are incompatible with some quantum mechanical predictions 
concerning entangled particles. He showed, for a simple Gedanken experiment, 
that the average values of certain random variables related to the experiment 
have to satisfy certain inequalities provided quantum physics satisfies a hidden 
variable theory and that these average values do not satisfy the above mentioned 
inequalities provided the current quantum theory holds. 

All Gedanken experiments and inequalities of the Bell type allow to test 
experimentally quantum nonlocality — such experiments are called Bell exper- 
iments. The first quite convincing test of Bell inequalities was performed by 
Aspect in 1982. Since then a variety of other experiments have been performed 
and they brought quite impressive confirmation of the current quantum theory 
and of its nonlocality. However, none of them have been without loopholes. The 
loopholes are mainly of two types. Either all particles on which measurements 
are performed are not too far or quality of detectors is not fully sufficient (see 
Percival, 1999). The views differ concerning our possibilities to perform a Bell 
experiment without loopholes (in other words to close all loopholes) in the near 
future. 

All experiments performed so far make use of untestable supplementary as- 
sumptions. For example the “fair sampling assumption”. Its basic idea is that 
“the detectors sample the ensemble in a fair way so that those events in which 
both particles are detected are representable of the whole ensemble” . Or the as- 
sumption “that detector efficiency is not dependent on local “hidden variables””. 

Without the above assumptions the detectors used in the experiments need 
to have some efficiency (estimated to be at least 67%) what is far more than 
what has been achieved so far. 

It is also far from being completely clear whether we can performed Bell 
experiments with ever increasing precision and whether we do not discover some 
laws of Nature that prevents us from doing that. 

In order to perform Bell experiments it is needed to generate pairs of en- 
tangled particles. The so called Mermin experiments and inequalities allow to 
test nonlocality using three and more particle entanglement. In these cases re- 
quirements on quality of detectors are smaller and quantum nonlocality can be 
determined more clearly. However, creation of entangled particles is a non-trivial 
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problem and its complexity grows rapidly with the number of particles that have 
to be mutually entangled. 

One can say that Bell experiments, based on Bell states, are able to show 
that statistical predictions based on quantum hidden variable theories differ 
from those provided by quantum mechanics. Later, Greenberger et al. (1989) 
have demonstrated that Gedanken experiments based on GHZ states can show 
that also definite predictions of any hidden variable theory differ from those 
provided by quantum mechanics. 

3.4 Creation of Entangled States 

Greation of entangled states in labs is a key and non-trivial problem for a variety 
of experiments concerning entanglement and of QIPG in general. 

There are various ways how to create entangled states. Let us list some of 
the main ones. 

Experimental generation using special physical phenomena. 

Entangled photons, atoms, ions and nuclear spins have been experimentally 
produced. Entangled atoms were produced using cavity quantum electrody- 
namics, entangled ions were produced in electromagnetic Pauli traps, con- 
trolled entanglement between nuclear spins within a single molecule was 
achieved using NMR technology. Several types of entanglement were pro- 
duced: polarization entanglement, time-energy entanglement, position and 
momentum entanglement, .... 

There are four basic methods how to create experimentally an entangled 
state: (a) a physical source is just producing entangled states; (b) A source 
may produce independent quantum states that are component of an entan- 
gled superposition and then an apparatus has to be used to shift them in 
time so resulting entangled state is obtained; (c) to project a state (entangled 
or not) into an entangled one — see entanglement swapping bellow. 

Sources of entanglement in condense-matter physics are now being inten- 
sively explored. 

Most of known mechanisms for generating entangled particles depend on the 
specific nature of the systems involved. However, also more general schemes 
for entangling particles start to emerge, see for example Bose and Home 
(2001). What is much needed are sources for which there is a high degree of 
control over the degree of entanglement and purity of states. 

To generate perfect pure entangled states, especially in the case of more 
particles is a very demanding task. Sometimes therefore the so called pseudo- 
pure states are used as sufficient. 

Application of unitary operations to separable states. For example, all 
Bell states can be obtained if XOR operation is applied to some separable 
states 10) |a), where a £ {0, 1} and |0) is a suitable one qubit state. 

Actually, some laboratory techniques to design entangled two-qubit states 
are quite similar. Some physical process is applied to two separable qubits to 
make them interact. On a general level the potential of this way of designing 
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two-qubit entangled states have been investigated by Kraus and Cirac (2000). 
They asked and answered the following question. Given a two-qubit unitary 
transformation U, what is the maximum of entanglement it can be obtained 
if U is applied to two qubits. In other words, what is the maximum of 
entanglement U can create, and for which qubit states. 

Entanglement swapping — teleportation of entanglement. It used to 
be thought that if some distant particles are in an entangled state, then some 
time ago they had to interact and only later they had been separated. How- 
ever, it is now clear that no direct interaction is necessary in order to produce 
entanglement between distant quantum systems. For example, entanglement 
can be teleported. A so-called entanglement swapping protocol has been 
proposed by Zukovski et al. (1993) — and experimentally verified by Pan et 
al. (1998). 

The basic idea is simple. If we have two pairs of particles entangled in a Bell 
state (Pi with P 2 and P 3 with P 4 ), Figure QJi, and a Bell measurement on 
particles P 2 and P 3 is performed, then we get two new entangled pairs: Pi 
with P 4 and P 2 with P 3 , both in a Bell state. 




Fig. 1. Entanglement swapping and its generalizations 



Technically, for any a, b,c,d G {0, 1}, if a Bell operator is applied to the state 
~{\abcd) + \dbcd) + \abcd) + \dbcd)), 

to its two middle qubits, then the resulting state is such that the particles 
Pi and P 4 , as well as P 2 with P 3 , are in a Bell state. (Observe that entangle- 
ment swapping actually allows superluminal establishment of entanglement 
between two distant parties.) 
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It is easy to see that by creating several intermediate stations for producing 
Bell states and for performing Bell measurements one can significantly speed 
with entanglement swapping the task to supply very distant users with Bell 
states. 

An elegant generalization of entanglement swapping to an arbitrary number 
q of quantum systems Sj composed from an arbitrary number rrij of qudits 
has been worked out by Bouda and Buzek (2000). Each of the systems Sj is 
supposed to be in a maximally entangled state of rrij qudits, while different 
systems are not correlated. They showed that when a set 
tides, with aj particles from the system Sj, are subjected to a generalized 
Bell measurement, the resulting set of particles collapses into a maximally 
entangled state. They also showed how to generalize results for the case of 
continuous variables. 



Peres’ Paradox. A curious modification of the entanglement swapping was 
developed by Peres (2000) using a simple protocol at which entanglement 
is produced a posteriori, after entangled particles have been measured and 
may no longer exists (!). 



3.5 Local Quantum Operations and Classical Communication 

In quantum information processing literature one often considers manipulations 
with multipartite, especially bipartite, systems at which only local operations 
and classical communications are allowed (by particular parties involved, each 
one in the possession of one of the subsystems). For this cases the following 
acronyms start to be used: 

LOCC — local unitary operations and (two-way) classical communication. 
LQCC — local cmantum operations (any, including superoperators and POVM 
measurementtu) and (two-way) classical communications. 

SLQCC — stochastic local quantum operations and classical communication 
(two states are SLQCC interconvertible, if each of them can be obtained 
from the other one using LQCC with non-zero probability). 

EALQCC — entanglement-assisted local quantum operations and classical 
communication (Jonathan and Plenio, 1999) — communicating parties are 
allowed, in addition to features offered by LQCC, first to “borrow” some en- 
tanglement, then to use it for local operations and classical communications, 
but without consuming it. 



3.6 Nonlocality without Entanglement 

Nonlocality is often identified with quantum entanglement. On one side, each 
pure entangled state violates a Bell type inequality and exhibits a nonlocality. 
However, a form of “nonlocality without entanglement” was demonstrated by 

^ Word of caution: often notation LOCC is used instead of LQCC 
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Bennett et al. (1998). They created an orthogonal set of product states of two 
qutrits (states in H^) that cannot be reliably distinguished by LQCC. (They 
have shown that there is a gap between mutual information obtained by any joint 
measurement on these states and a measurement at which only local operations 
are performed.) 

4 Detection of Entanglement 

It is fairly easy to determine whether a given pure bipartite state is entangled. On 
the other hand, to determine whether a given density matrix is entangled seems 
to be a very nontrivial problem. Non-triviality of the problem is due to the fact 
that in the density matrices quantum correlations are weak and a manifestation 
of quantum entanglement can be very subtle (see Horodeckis, 2001). 

4.1 Separability Criteria 

By a separability criterion we call a condition for density matrices that is 
satisfied by all separable density matrices and it is not satisfied by all density 
matrices. More of entangled density matrices do not satisfy a separability crite- 
rion, stronger this criterion is. For example, all Bell inequalities (that are always 
satisfied by separable states) represent an interesting set of (not very strong) 
separability criteria|§For each pure entangled state there is therefore a set of 
measurements that allows to detect its entanglement. 

A simple, but powerful, separability criterion, the so called positive partial 
transposition criterion (PPT-criterion) is due to Peres (1996). It is based on 
the observation that if a density matrix p of a composed (bipartite) quantum 
system A (E) B is separable, then its partial transpose p has to have positive 
eigenvalues. 

A partial transpose p of p is obtained from p by making a transposition in 
p with respect to only one of the subsystems. Formally, if both A and B are 
n-dimensional Hilbert space and a density matrix p of A 0 5 is decomposed 
into n X n submatrices, then the transposition of p, is obtained just by making 
transposition of submatrices as follows: 



( All • 


■ \ 




/AL- 


• Aj’n 


V^nl • 


• -^nn / 


p = 




■ Ann 



® The so called Bell-CHSH inequalities are inequalities of the form Tr{pB) < 2 , where 
B are matrices of the form B = riug) (f2 -|-r3)o'-|-f4cr(g) (f2 — fsjcr, where f’s are unit 
real vectors and if r = (ri,r2,r3), then fa = ria^ + J"2<Tj, + r 3a where ax,ay,a^ 
are Pauli matrices. 
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For example, the density matrix p of the Bell state and its transpose p 
have the form 





/O 0 0 0 \ 




/O 0 0 1 \ 
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0 110 
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0 10 0 


^=2 


0 110 


’ ^=2 


0 0 10 




1^0 0 0 0 ^ 




1^1 0 0 0 / 



and p has —5 as one of the eigenvalues. 

A state p is called PPT-state (NPT-state) if its partial transpose is pos- 
itive (is not positive). Peres’ criterion was shown by Horodeckis (1998c) to be 
a necessary and sufficient condition for density matrices of bipartite systems of 
dimension 2x2 and 2 x 3 to be entangled. 

4.2 Entanglement Witnesses 

A very interesting and useful approach to the detection of entanglement is based 
on the concept of entanglement witness introduced by Horodeckis (1996) and 
coined by Terhal. 

An entanglement witness for density matrices of a Hilbert space H of dimen- 
sion n is any matrix (observable) W of dimension n such that 

~ Tr{W p) > 0 if p G i? is a separable density matrix. 

~ There is an entangled density matrix p G H such that Tr{W p) < 0. 

In other words an entanglement witness can detect entanglement in density 
matrices. 

Swapping (or flipping) operator V defined by V{\(j)) \ip)) = \ip) is an 

example of an entanglement witness. 

Since an entanglement witness can fail to discover entanglement a natural 
question is to find as best as possible entanglement witnesses. Namely, such that 
do not fail at all or fail very rarely to detect entanglement. 

An entanglement witness W is called optimal, see Lewenstein et al. (2000), 
if there is no other entanglement witness that detects the same entanglement 
as W and more. Lewenstein et al. (2000) also found a way to design optimal 
witnesses. This allows well experimentally detect quantum entanglement. For 
example, the optimal entanglement witness for the state cos 6 * 1 00) -I- sin 0| 11) has 
the form 

W = ^(|01)(01| + |10)(10| - |00)(11| - |11)(00|). 

5 Power of Entanglement 

The role of entanglement as of an important computational and communicational 
resource is often much emphasized. There have been many strong statements 
about the key role of entanglement for having quantum information processing 
to be more powerful than classical one. However, much more research seems to 
be needed to get a really full (or at lest sufficient) understanding of the role of 
entanglement for communication and, especially, for computation. 
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5.1 Power of Entanglement for Computation 

Gottesman and Chuang (1999) have shown that entanglement is a computa- 
tional primitive because it is possible to realize any quantum computation by 
starting with some GHZ states and then performing one qubit operations and 
Bell measurements. 

There are also other results showing that entanglement is a sort of substrate 
on which quantum computation can be performed. For example, Rausschendorf 
and Briegel (2000) have shown that universal quantum computation is possible 
by starting with a special multipartite entangled states and then performing 
only single qubit measurements, to model a circuit computation. 

In addition, Shor’s algorithms make heavy use of entanglement, and no way 
seems to be known how to perform efficient factorization, or to solve efficiently 
the hidden subgroup problem, without entanglement. 

In spite of that it is far from clear that entanglement is the key resource 
behind the efficiency of fast quantum algorithms. Quantum superposition, par- 
allelism and, especially, quantum measurement are other, perhaps even more 
important, quantum resources for computation. 

It is also interesting to notice that quantum entanglement does not play 
a role in the current theory of quantum automata (quantum finite automata, 
quantum Turing machines, quantum cellular automata). It does not seem also 
that entanglement play a role at the definition of such quantum complexity 
classes as BQP. 

To illuminate the role the entanglement has for efficient quantum computa- 
tion is one of the key tasks of the current QIPG research. Unfortunately, it does 
not seem to be a very visible way how to do it. 

5.2 Power of Entanglement for Communication 
— Quantum Communication Mysteries 

While in quantum computation we merely believe that quantum mechanics al- 
lows exponential speed-up for some computational tasks, in quantum communi- 
cation we can prove that quantum tools can bring even exponential savings. 

Let us start our discussion of quantum communication by pointing out that 
there are actually good reasons to believe that quantum communication cannot 
be of special use. Indeed, Holevo theorem says that no more than n bits of 
classical information can be communicated by transmitting n qubits - unless two 
communicating parties are entangled and in such a case at most twice as many 
classical bits can be communicated, using quantum dense coding, see Section El 

In addition, entanglement itself cannot be used to signal information — oth- 
erwise faster than light communication would be possible. Our intuition there- 
fore says that there should be negative answers to the following fundamental 
questions (Brassard, 2001): 

— Gan entangled parties make better use of classical communication channels 
than non-entangled parties? 
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— Can entangled parties benefit from their entanglement even if they are not 
allowed any form of direct (classical or quantum) communication? 

Surprisingly, as we are going to discuss now, and as the puzzles already indicated, 
our intuition in these cases has turned out to be very wrong. 



5.3 Entanglement Enhanced Quantum Communication Complexity 

In the so called entanglement enhanced quantum communication model, only 
classical bits are communicated, but communication is facilitated by an a priori 
distribution of entangled qubits among the communicating parties. Communica- 
tion protocols are defined in the quantum case in a similar way as in the classical 
case. 

An exponential gap between the exact classical and quantum entanglement 
enhanced communication complexity has been shown by Buhrman et al. (1998), 
for the exact solution of the following promise problem. 

Let f,g : {0, 1}" — ?> {0, 1}, and let A{f,g) be the Hamming distance between 
/ and g which equals the Hamming distance of 2”-bit strings /(0)/(l) . . . /(2" — 
1) and g{0)g{l)...g{2'^ — 1). Let EQ’ be the partial function defined by 



EQ'(/,5) 



l,ifZ\(/,5) = 0, 
0,ifZ\(/,5) = 2"-i 



and undefined for other arguments. (This can be seen also as the requirement to 
solve the above equality problem under the promise that A{f,g) G {0,2”“^}.) 

Raz (1999) presents another example of a communication promise problem, to 
compute a partial function, for which there is a quantum communication protocol 
with exponentially smaller communication complexity than any probabilistic 
communication protocol exhibits. 

Open problem. Can we show an exponential gap between the classical com- 
munication complexity and the entanglement enhanced quantum communication 
complexity also for total functions in a non-promise setting? 



5.4 Spooky Quantum Communication Complexity 

The second question formulated on page 001 has also positive answer. It is the 
question whether entanglement can be used instead of communication. In other 
words, whether quantum pseudo telepathy is possible and what we can say about 
it quantitatively. 

One way to approach the problem (Brassard, 2001), could seem to be to ask 
whether we can have a function f : X x Y ^ Z such that it is not possible to 
determine f{x, y) from the knowledge of either x or y, but two entangled parties 
can compute f{x,y), if one is given an x and other a y. However, this cannot 
be the case, because this would allow faster than light communication. However, 
there is a way to find a positive answer to the above problem when computation 
of relations and not of functions is considered. Spooky communication complexity, 
Brassard et al. (1999), has been introduced as follows. 
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Let X, Y , A and B be sets and R C X x Y x Ax B.la the initiation phase, 
two later physically separated and entangled parties, Alice and Bob, are allowed 
to establish a strategy. After they are separated one of them is given a,n x G X 
and second & y GY and their goal is to produce a G A and b G B, each of them 
one, such that (x, y, a, b) G R. It is said that spooky communication or pseudo- 
telepathy takes place, if the above task cannot be done for sure classically, but 
it can be done once these two parties share entanglement. Spooky communi- 
cation complexity of the task is then defined as the amount of entanglement 
required to perform the task. 

Brassard et al. (1999) has shown, for the relation R defined bellow, that the 
number of bits to compute the relation classically is exponentially larger than 
the number of entangled pairs shared by the parties at spooky communication 
(and therefore that spooky communication can be exponentially more effective). 

Let k be an integer, n = 2^, X = Y = {0, 1}", A = B = {0, 1}*^ and 

{ X = y and a = b; 

A{x, y) = f and a b; , 

A{x,y) ^ {0,n/2}. 

where again Z\(x, y') denotes the Hamming distance. (In other words, if Alice and 
Bob are promised that their inputs are either the same or differ in half of bits 
and they have to produce the same outputs only if their inputs are the same.) 

It has been shown that there is a constant c such that the above relation 
cannot be established classically with fewer than c2A bits provided k is sufficiently 
large, but k entangled pairs are sufficient for a spooky communication solution. 

6 Quantum Teleportation and Superdense Coding 

Perhaps the most interesting and inspiring demonstration of entanglement, at 
least so far, as a communication resource is quantum teleportation, due to Ben- 
nett et al. (1993), that allows transmission of an unknown quantum state to a 
distant place in spite of the impossibility of measuring or broadcasting states to 
be transmitted. Quantum teleportation is arguably one of the main achievements 
of quantum information theory]^ 

Experimental verification of quantum teleportation is still one of the main 
experimental challenges in quantum information processing. The first attempt 
to perform such a teleportation was successfully, to a certain degree, done by 

® The so called No-teleportation theorem, one of the fundamental laws of quantum 
mechanics, says that (classical) teleportation is impossible. This means that there is 
no way to use classical channel to transmit faithfully quantum information. In more 
technical words there is no possibility to measure a quantum state in such a way that 
the measuring results would be sufficient faithful to reconstruct the state. (It is easy 
to see that classical teleportation would imply that quantum cloning is possible and 
this would imply that joint measurements are always possible and finally this would 
imply, through so called Bell telephone, see Werner (2000) for a lucid explanation, 
that superluminal communication would be possible. 
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Zeilinger’s group in Innsbruck (see Bouwmeester et al. (1997)). Currently the 
most perfect teleportation experiment seems to be reported by Kim et al. (2000), 
as the first experiment that satisfies the following basic requirements: (a) arbi- 
trary state can be teleported; (b) there is output state that is “instantaneous 
copy” of the to-be-teleported state; (c) all four Bell states can be distinguished 
in the Bell measurement. 



6.1 Basic Teleportation Schemes 

The most basic task is teleportation of pure quantum states, especially qubits. 



Teleportation of Qubits. The basic teleportation protocol runs as follows. 
Alice and Bob share two particles in a Bell state (forming the so called 
teleportation channel <?), and Alice posses a particle C in an unknown state 
a|0) -|- /3|1). If Alice performs the Bell measurement on her particle and on the 
particle C, then Bob’s particle gets with the same probability into one of the four 
states a|0) -I- /3|1), /3|0) -I- a|l), a|0) — /3|1), — /3|0) -I- a|l), and the classical result 
of the measurement, obtained by Alice, tells uniquely which of these four cases 
has materialized. Once Alice sends this classical information to Bob, he knows 
which of the Pauli matrices to apply to his particle to get it into the (unknown) 
state a|0) -I- j3\l). 

Remark. The following facts concerning quantum teleportation are worth 
to observe: (a) There is no need to know the state in order to be able to tele- 
port it; (b) All quantum operations performed during teleportation are local; 
(c) All operations performed during teleportation are independent of the state 
being teleported; (d) Quantum teleportation can be seen as a “real teleporta- 
tion” at which an object is at first dematerialized, then transfered and finally 
reassembled, at a different place — Bob’s particle at the end of the teleportation 
is undistinguishable from Alice’s particle at the beginning of teleportation; (e) 
In spite of the fact that maximally entangled state that Alice and Bob share 
is publicly known, teleportation represents an absolutely secure way of trans- 
ferring quantum information at which no eavesdropping is possible — because 
no particle is transfered; (f) Observe that after the measurement Bob’s particle 
is in the totally mixed state and therefore a potential eavesdropper cannot 
learn anything about state being teleported this way. 

6.2 Generalized Teleportation Schemes 

There are, as one would expect, several generalization of the basic teleportation 
scheme to explore. One of them is to consider two party teleportation, but of 
more dimensional states than qubits. The other natural possibility is to consider 
multipartite teleportation 

A general approach to two-party quantum teleportation has been worked out 
by Accardi and Ohya (1999) and especially by Albeverio and Fei (2000). The 
problem that has been solved is the following one: Given a state of a Hilbert 
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space Hi, and an entangled state IV' 2 , 3 ) of the tensor product of Hilbert spaces 
with all three Hilbert spaces having finite dimension and with dimension 
of Hi being not smaller than that of H^, conditions are established for a unitary 
transformation U on Hi®H 2 such that if on the state U ®I{\4>i)\'tp2,z)) a proper 
measurement is performed than the resulting state of is such that the 
result of the above measurement uniquely determines a unitary transformation, 
not dependent on \4>i), which, when applied on {ips), yields the state 

Concerning multipartite teleportation, Gorbachev and Trubilko (1999) at 
first showed a simultaneous teleportation of an unknown state a|00) + /3|11) 
through a GHZ-channel to two parties. However, Marinatto and Weber (2000) 
showed that it is not possible to perform teleportation of any two-qubit state. 
Gorbachev et al. (2000) then showed how to teleport any entangled two qubit 
state using GHZ channel and three bits of classical information. 

6.3 Remote State Preparation (RSP) 

Let us now investigate the case that the party in need of teleporting a state has 
a full classical knowledge of the state. Is it reasonable to expect that in such a 
case either less ebits or less bits are needed for teleportation? At least in some 
cases? 

The answer is, quite surprisingly, positive. As shown by Lo (1999) and Ben- 
nett et al. (2001), in such a case there is a non-trivial tradeoff between the ebits 
and bits. Bennett et al. (2001) showed, for example, that asymptotically classical 
communication cost of RSP, in the presence of a large amount of preshared en- 
tanglement, can be reduced to one bit for qubit provided four ebits are used (and 
becomes even less when transmitting part of a known entangled state). During 
RSP most of the preshared entanglement is not consumed and it can be recov- 
ered, using a backward communication, from Bob to Alice. On the other hand, 
if a large number of bits is used, say n, the number of ebits needed grows like 
n2“". Protocols developed by Bennett et al. (2001) work even for finite number 
of (identical) states Alice wants to prepare for Bob. A certain catch is that no 
protocol is known that creates states always faithfully and uses in all cases less 
classical communication, per qubit, than quantum teleportation. 

6.4 Teleportation as a Computation Primitive 

Teleportation illustrates well the known fact that quantum measurements can 
be seen as an important quantum computational primitives and that in some 
cases quantum measurement does not have to destroy quantum information irre- 
versibly. Just the opposite, it can help to transfer quantum information from one 
place to another. In addition, as shown by Gottesman and Ghuang (1999), gener- 
alized quantum teleportation can be seen as a universal quantum computational 
primitive. 
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6.5 Teleportation and Nonlocality 

It seems intuitively clear that quantum teleportation is due to nonlocality that 
(measurements of) entangled states exhibit. However, quite surprisingly, Hardy 
(1999) has shown that concepts of teleportation and nonlocality are conceptually 
independent. Namely, he constructed a toy model which is local and within which 
no-cloning theorem holds, but still teleportation holds. 

6.6 Dense Coding 

Teleportation uses two bits to teleport one qubit. Dense coding (Bennett et al. 
(1993)), is dual to teleportation. It uses one qubit to send two bits. 

Let Alice and Bob share the Bell state Depending on two bits to be 

send Alice performs on her particle one of the Pauli operators and sends her 
particle to Bob. Bob disentangles particles using XOR operator and measures 
his particle in the standard basis and Alice’s particle in dual basis. The result 
of the measurement uniquely determines two bits Alice intended to transmit to 
Bob. 

6.7 All Teleportation and Dense Coding Schemes 

Werner (2000) brought an important new insight on possible teleportation and 
dense coding protocols. He showed not only that there is one-to-one correspon- 
dence between (optimal) teleportation and dense coding schemes (realized with 
minimal resources (with respect to Hilbert space dimension and classical in- 
formation)), but that there is also a one-to-one correspondence between such 
schemes and schemes for the three other tasks: (a) construction of orthonormal 
bases of maximally entangled states; (b) construction of orthonormal basis of 
unitary operators with respect to Hilbert-Schmidt scalar product; (c) depolariz- 
ing operations whose Kraus operators can be chosen to be unitary. 

On one side it is not surprising that there is a close relation between telepor- 
tation and dense coding protocols because actually they often perform basically 
the same operations only in the opposite order. One-to-one relations to other 
problems is more surprising. However, it helps to see complexity of the problem 
to design teleportation schemes because, for example, the problem of construct- 
ing orthonormal basis of unitary operators is shown to involve such not well 
understood tools as Hadamard matrices and Latin Squares. 

A natural generalization is to consider dense coding over n-partite channels. 
As shown by Gorbachev et al. (2000), in such a case the improvement of the 
classical capacity of quantum channel is only Moreover, in this multipartite 
channel setting there are no such deep relations between teleporting and dense 
coding as discussed above. 

6.8 Teleportation, Mixedeness and Dense Coding 

A very natural question is how good are mixed states for teleportation and 
dense coding. One reason to ask this is that pure entangled states are harder to 
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produce than mixed states. Second reason is that mixed states can have more 
entanglement than pure separated states, if von Neumann entropy is taken as a 
good measure of mixedeness. 

Horodeckis (1999) have shown that in the case of bipartite systems of dimen- 
sion n the singlet fraction F{p) < - implies that teleportation with p cannot be 
done with better than classical fidelity. Bose and Vedral (1999) improved that 
condition to S{p) > lgn-|-(l — ^) Ig(n-l-l). This means that for quantum systems 
of very large dimension, even an entropy close to maximal is not sufficient to 
ensure failure of teleportation. 

Concerning dense coding, an entangled mixed state p fails to be useful for 
dense coding when S{p) > Ign. Since the threshold Ign is smaller than that for 
teleporation, Ig n-k (1 — ^) lg(n-|- 1), we can conclude that teleportation is “more 
robust” with respect to the external noise than dense coding. 

For a general case that two parties A and B share mixed state p over n x n- 
space the problem to determine capacity C{p) of super dense coding channel 
of p is almost solved. Hiroshi (2000) established the bound Er{p) < C{p) < 
Ign -k Er{p). Bowen (2001) conjectures that C{p) = S{pb) — S{p) -k lg3 and 
shows this for n < 3. 

7 Entanglement Distillation 

— Concentration and Pnrification 

Various applications (of entanglement) require that communicating or cooper- 
ating parties share maximally entangled pure states. For example, they can en- 
sure perfect security of communication or rapid performance of certain forms of 
distributed computing. On the other hand, communicating and/or cooperating 
distant parties have to their disposal mostly not maximally entangled states and 
mainly, not pure, but mixed states. The question is then how to get out of them, 
using only LQCC, maximally entangled pure states. In this connection of large 
importance are two tasks. 

— Entanglement concentration. How to create, using only LQCC maximally 
entangled pure states from not maximally entangled ones. 

— Entanglement purification. How to distill pure maximally entangled states 
out of mixed entangled states. 

Remark. Often term distillation is used to mean both concentration or pu- 
rification. 



7.1 Entanglement Concentration 

Schmidt projection method (Bennett, 1996a) for entanglement concentration is 
asymptotically optimal, but requires to have (infinitely) many copies of the state. 

Bennett et al. (1995) developed a method to make entanglement concen- 
tration starting with the state (a|00) + /3|11))®” and producing the expected 
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number of maximally entangled state equal to iJ(|ap) — 0(lg(n)). A simplifica- 
tion of the above method, with the same asymptotic performance, but also with 
detailed concentration circuits is due to Kaye and Mosca (2001). 

A different and practically important task is to perform optimal entanglement 
concentration for a single pure bipartite state. 

Jonathan and Plenio (1999) found a way, using Nielsen’s (1998) theorem on 
transformation of bipartite states, to solve the following entanglement concen- 
tration problem. 

Alice and Bob share a pure bipartite state \ip) with Schmidt decomposition 

n 

IV’) = X! v^iV’*)iV’*)> 

i=l 

whose entanglement they want to concentrate using LQCC. Consider the set 
of local transformations that generate the maximally entangled state \0j) = 
IV’i)IV’j)j ^ ^ j ^ n with probability Pj. Since von Neumann reduced 
entropy of \0j) is Inj, for the average amount of distilled entanglement it holds 
(E) = The following theorem shows how this quantity can be 

maximized. 

Theorem The optimal entanglement concentration procedure for a bipartite 
state I'i/’) with ai > a 2 >■■■> ctn > 0 is one that produces the maximally 
entangled state \4>j) with probability Pj = j{aj — aj+i)- The corresponding 
optimal average entanglement is then 

n 

{E)max = - aj+i)jlnj. 



7.2 Entanglement Purification 

The goal of entanglement purification techniques is to distill, from a set of m 
identical mixed two-qubit states, a set ofn < m maximally entangled pure states, 
say singlets, with the ratio — as large as possible. 



Global Purification Techniques. Conceptually simple purification technique 
was developed by Bennett et al. (1996). To describe this technique it is useful 
to consider two quantities on mixed states p in a Hilbert space H : 

— Singlet fraction F{p) = {E~\p\E~); 

— Fully entangled fraction E{p) = m.axp{\'tp\p\tf;), where maximum is taken 
through all maximally entangled states. 

Let Alice and Bob share n mixed states p with F{p) > They iterate the 
following steps: 
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1. They choose two pairs of particles, each pair in the state p, and apply to 
each of them unitary transformation U ®U* , where U is randomly chosen. 
(This way states p are transformed to new identical states with the same 
singlet fraction.) 

2. Each party performs XOR operation on their particles. 

3. The pair of the target particles is measured in the standard basis and it 
is discarded. If the results agree the source part is kept and has a greater 
singlet fraction as p. 

This method allows to get mixed states with arbitrarily large singlet frac- 
tions, but its asymptotic rate is zero and it requires that F{p) > However, 
a modification of the method, developed by Horodeckis (1997a), works for all 
two-qubit mixed states. 

There is one crucial problem with the above (global) purification protocols. 
They rely heavily on carrying out collective measurements on a large number of 
shared states, assume infinite supply of shared states and work only asymptoti- 
cally. 



Local Purification Protocols. A practically important and theoretically in- 
teresting problem is then to determine how much purification, and when, can be 
done if we have only one mixed state to purify and only local quantum operations 
can be used. In other words, what is the power of local purification schemes? 

It is known that some mixed states can be purified using local purification 
techniques. However, as shown by Kent (1998), no local purification protocol can 
produce maximally entangled pure state from any entangled mixed state. (For 
example, some Werner states cannot be purified locally.) In addition, Kent et 
al. (1999) designed a purification (and concentration) protocol for mixed states 
of two qubits that is optimal in the sense that protocol produces, with non-zero 
probability, a state of maximal possible entanglement of formation. 



Practical Methods of Purification. Since distribution of entangled states 
between distant parties seems to be of large importance for various quantum 
communication protocols, such as quantum cryptography, quantum teleporta- 
tion and so on, and entanglement purification is the main way to distill highly 
entangled states from less entangled states or mixed states, it is of large interest 
to find practical entanglement purification techniques. 

The techniques discussed above are based on the use of XOR or similar 
quantum gates — on tools that are far from easy to implement perfectly enough 
to be really useful for purification. An important challenge of QIPC is therefore 
to look for other, more practical, purification techniques. 

One such a technique has been developed by Pan et al. (2000). This tech- 
niques requires to use only simple linear optical elements that have very high 
precision. It should lead to a procedure that is within the reach of the current 
technology. 
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8 Entanglement Measures 

Once it has been realized that quantum entanglement is a precious computation 
and communication resource, it has become important to quantify entanglement 
in order to determine how effectively we can process and communicate informa- 
tion. 

It has turned out quite easy to quantify entanglement of pure states. On the 
other side, it has turned out more difficult to quantify entanglement of bipartite 
mixed states in a physically meaningful way, but quite a progress has been 
achieved. On the other hand, too little is known about the key problem in this 
area — how to quantify entanglement of multipartite mixed states. 



8.1 Axioms 

An entanglement measure is a real- valued function defined on mixed states (den- 
sity operators) of bipartite or multipartite quantum systems satisfying some 
physically (and mathematically) motivated conditions. 

The problem of finding minimal general conditions, usually termed “axioms” 
in physics literature, each good measures of entanglement should satisfy, has 
been dealt with for quite a while. There are several, more or less compatible, 
proposals for such axioms. For example, in Bennett et al. (1996a): (a) E{p) = 0 iff 
p is separable; (b) invariance under local unitary operations; (c) the expectation 
of entanglement should not increase under LQCC; (d) additivity: E{\(j)) 0 |^/i)) = 
E{\(j))) + E{\tp)), for the case that \cj)) and \tp) are independent bipartite states; 
(e) stability with respect to a transfer of a subsystem from one party to another. 
(In any tripartite state \tjj) of a system A® B®C the bipartite entanglement of 
A® B with C should differ from that of A with B ® C by at most the entropy 
of B.) 

Horodecki et al. (1999) formulated perhaps so far the most elaborated set of 
axioms a measure of bipartite entanglement should satisfy. Their axioms seem 
well to reflect current physical understanding of the subject, (a) non-negativity, 
E{p) > 0; (b) E{p) = 0 if p is separable; (c) normalization: B(|^+)(<?+|) = 1; (d) 
monotonicity under local operations — if either of the parties sharing a pair of 
particles in the state p performs an operation leading to a state pi with probabil- 
ity Pi, then E{p) > J2iP^E{pi); (e) convexity: E{J2iPiPi) < J2iPiE{pi); (f) par- 
tial additivity: B(p®’") = nE{p); (f) continuity: if lim„^oo(^+®”|p«|^+®”) = 0, 
then also lim„_>oo ^[B(^+®") — E{pn)] = 0, where p„ is some joint state of n 
pairs. 

The underlying thesis behind the above set of axioms is that the entanglement 
of distillation is a good measure of entanglement (and therefore there is no “iff” 
in the axiom (b) — because the bound entanglement, discussed in Section 1711 is 
not distillable). 

It has been claimed that for pure states von Neumann reduced entropy is 
the only measure that satisfies relevant axioms from the above set of axioms. 
However, as discussed also by Rudolph (2000), who analyzes current state of art 
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in this area, the problem of proper axioms seems still to require a more formal 
study, and probably some other axiom(s) may need to be added. 

The case of measures of entanglement for multipartite systems is widely open. 

8.2 Entanglement of Pure States - Entropy of Entanglement 

For a pure state \4>) of a bipartite system A® B there seems to be a single good 
measure of entanglement if (| </>)), defined using relative von Neumann entropy 
and p = \4'){(j)\ as follows. 

E{\(j>)) = -Tr{pA\gPA) = -Tr{pB\gPB)- 
Hence, entanglement of a pair of quantum states is defined as the entropy of 
either member of the pair. 

This measure satisfies all axioms defined above. Moreover, any set of bipartite 
pure states the total sum of entanglement of which is an E can be reversibly 
interconverted into any other set of bipartite pure states with the same total 
entanglement E. 

8.3 Measures of Entanglement of Mixed Bipartite States 
and Their Properties 

Qualitatively, a bipartite density matrix p is entangled if some in principle phys- 
ically possible transformations on p cannot be realized if both parties perform 
only local quantum operations and classical communication. 

In spite of the large research effort, we do not have yet good quantitative 
understanding of the entanglement of mixed states. Let us list some of the basic 
questions for which we do not have yet satisfactory answer: 

1. How to determine efficiently whether a density matrix is entangled? (Dis- 
cussed in Section^. 

2. How to measure entanglement of bipartite mixed states? 

3. Is entanglement irreversibly lost if we attempt to convert it from one mixed 
state to another? 

One of the problems behind is that a density matrix can correspond to in- 
finitely many mixtures of pure states and these pure states may have very differ- 
ent entanglement properties. For example, the completely mixed state of two 
qubits corresponds to the equally probably mixture of (maximally entangled) 
Bell states, but also to a mixed state consisting of only unentangled states. 

Currently, there seems to be no “canonical-way” to quantify entanglement of 
bipartite states and it seems that no single measure of entanglement of mixed 
states is proper in all situations/applications. 

Entanglement of Formation E/, (Bennett et al. 1996a), is defined for a 
density matrix p by 

Ef{p) = min{^piE('0i) | p = 
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where p is the density matrix for the mixed state V’i)- In other words, 

entanglement of formation, Ef{p), is the least expected entanglement of any 
ensemble of pure states with p as the density matrix. 

Entanglement of formation is known exactly for many states, including all 
two-qubit states (see Hill and Wooters, 1997), having no more than two non-zero 
eigenvalues. In addition, Uhlmann (1997) has shown how to compute entangle- 
ment of formation for those states where Ef is not known to have a closed form. 

An important variant of the entanglement of formation is its asymptotic 
version 



£“((>) 



n—^oo fi 



even it seems to be very hard to compute. 

Open problem. Is entanglement of formation additive? (A positive answer 
would imply Ef = A“). 



Entanglement of Distillation (distillable entanglement), Ed (see Bennett et 
al. 1996a), is intuitively, the maximum asymptotic yield of singlets that can be 
produced from the given mixed state by local operations and classical commu- 
nication. 

Distillable entanglement can be further classified by the type of classical com- 
munication allowed (no classical communication (Edo), one-way classical com- 
munication (Edi), and {Ed^) two-way classical communication) and depends also 
on the type of the local operations allowed. 

Another way to define, informally, entanglement of distillation of a mixed 
state p is the maximum over all allowable protocols of the expected rate at which 
singlets can be obtained from a sequence of states p. For a rigorous treatment 
of the entanglement of distillation see Rains (1998). 

One can also say that if Alice and Bob share n pairs of particles in mixed 
state p, then they can faithfully teleport nEd{q) qubits. 

One of the fundamental questions concerning distillation of quantum states 
was whether distillation is in principle an irreversible process. More technically, 
whether Ed{p) < EJ^ip) for some states p. Horodeckis (1999a) have shown that 
this is indeed the case. 



Entanglement of Assistance, Ea, (DiVincenzo et al. 1998), is in a sense dual 
to the entanglement of formation 

Ea{p) = max{^p,E(V’i) | p = 

i i 



Entanglement of Relative Entropy. Er, (Vedral and Plenio, 1997) defined 
by 

Er{p) = min{Trp{lgp-lgp')), 
p gt> 

where T> denotes the set of unentangled mixed states. 
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In spite of the fact that this entropy has mainly mathematical motivation it 
has turned out as powerful tool to study entanglement. 



Relations among Different Measures of Entanglement. The following 
relations hold among the above entanglement measures (see Vedral and Plenio, 
1997, and Bennett et al. 1996a): 

Edo < Edi < Ed2 < Ej. < Ef. 

All these entanglement measures are reduced to the entropy of entanglement 
for pure states. 

Remark. The development of quantitative theory of entanglement can be 
seen as a major task of QIPC. There is already a large body of (incomplete) 
knowledge about the above measures of entanglement. However, there are also 
good reasons to be cautious about their importance because so far there are not 
too many results connecting these measures of entanglement with other major 
problems of QIPC. A search for new measures of entanglement and new ways 
how to approach this problems are clearly much needed. 

Some of the above measures of entanglement, as entanglement of formation 
and distillation, use Bell states as a standard unit of entanglement. This has two 
disadvantages as pointed out by Nielsen (2000). (a) Measures are not easy to 
generalize for multipartite entanglement because it is not clear what to take as a 
standard unit of entanglement in such multipartite systems, (b) Both measures 
essentially depend, as shown by Nielsen (2000) on the choice of the standard 
unit of entanglement what, intuitively, should not be the case for good measures 
of entanglement. They should be “dimensionless”. 

To deal with this problem Nielsen (^2000) suggested three new measures of en- 
tanglement: entanglement of creatioi^ Ec{p), entanglement of eommunicatioi^ 
Eco{p), and entanglement of eomputation. Interesting enough, the following iden- 
tities hold: Ec = Ef and = Ed- 

Nielsen (2000) defines also entanglement of computation and conjectures that 
there may be important connection between this measure of entanglement and 
key problems of computational complexity. However, his measure of entangle- 
ment depends on the choice of some “universal” two qubit gates and just counts 
the number of these gates needed, in addition to one qubit gates and classical 

^ Entanglement of creation of a state p, Ec{p) is defined as follows: Alice and Bob wish 
to create an n copies of p having no preshared entanglement, but having possibility 
to communicate qubits (at a cost) and perform classical communication (at no cost). 
The entanglement of creation is then defined to be asymptotically minimal value for 
the ratio of the number of qubits m they transmit to the number n of good (as 
n oo) copies of p they create. 

® Entanglement of communication is defined as follows: Suppose Alice and Bob share 
n copies of p. Then entanglement of communication of p, Eco{p), is defined to be 
the asymptotically maximal value for the ratio ^ of the number of qubits m Alice 
can transmit to Bob with asymptotically high fidelity, using LQCC, and n preshared 
copies of p. 
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communication, to create an entangled state. It is not yet clear which properties 
such a measure has. 

In addition, a variety of new measures of entanglement have been introduced 
recently in literature. However, it seems to be too early to judge their importance. 

Since it is currently not clear enough which approach to quantification of 
entanglement is the best, the most natural thing to do is to explore available 
approaches more in depth, and to introduce new ones, till one (some) of them 
emerges as the proper one. 

9 Multipartite Entanglement 

Understanding, characterization, classification, quantification as well as manipu- 
lation of multipartite entangled states, is a very important and difficult problem, 
perhaps the main current challenge in quantum information theory, that needs 
to be explored in order to get more insights into the possibilities of distributed 
quantum computing and networks. The problem is also of special interest outside 
of QIP, for physics in general. 

Let us start with a simple example illustrating how counterintuitive can be 
a situation in the case of multipartite entanglement. 

In an n-qubit state \4>) it seems to be natural to say that qubits i and j are 
entangled if by tracing out the remaining qubits from we get an entangled 

density matrix. However, in such a case in the GHZ state ^(|000) -I- |1H)) 
no pair of qubits is entangled even though the GHZ state would seem to be 
maximally entangled. On the other hand in the state ;^(|100) -I- |010) -I- |001)), 
that seems to be less entangled, actually any two qubits are entangled. 

Moreover, as discussed also later, for any integer n and any graph G with 
the set of vertices {1, 2, . . . , n} there is a quantum state \(j)c) over such that 
qubits i and j are entangled in \<f>g) if and only if nodes i and j are connected 
by an edge in G (Diirr, 2000). 



9.1 Creation of Multipartite States 

There are several techniques how to create multipartite entangled states. A sim- 
ple idea is due to Bose et al. (1997), as a generalization of the above procedure 
of entanglement swapping. In the generalized entanglement swapping one deals 
with generalized n-partite Bell states of the form 

^{\x) + \x)), 

where x G {0, 1}” and ir is a bitwise complement of x. Let us assume that we 
have N sets of entangled particles, each one in a generalized Bell state. Let from 
the tth set pi particles are brought together and a joint Bell measurement is 
performed on them. (Observe that all sets of Bell states of p particles form an 
orthogonal basis.) In such a case all measured particles get into a Bell state and 
the same is true for the rest of particles. 
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Figure Eh shows the case of one GHZ state and two bipartite Bell states. The 
result of the measurement is one GHZ state and one 4-partite Bell state. 

A generalization of the entanglement swapping can also be used to create 
from an m -partite Bell state and from one GHZ state an (m -I- l)-partite Bell 
state by simply performing a Bell measurement on one particle of the m-partite 
state and one particle of the GHZ state, see Figure 

Sometimes a flexibility is needed to create “on demand” a generalized Bell 
state for a group of particles. In such a case the following method can be used, see 
Figure^. Each party involved share a singlet with a center O. If a set of parties 
wants to share an multipartite state, then the center performs a generalized Bell 
measurement of its share of singlets to make the corresponding second particles 
of parties to get into a generalized Bell state. 

In full generality, for qudits and also for the continuous variable case, the 
problem of entanglement swapping has been solved by Bouda and Buzek (2000). 

Experimental creation of multipartite systems have also been attempted: 
with photon, atoms, ions and nuclear spins at NMR technology. One of them 
is generation of GHZ states. There has been also experimental creation of a 
pseudo-pure cat state ;^(|0000000) -I- 11111111)). 

9.2 Incomparable Forms of Multipartite Entanglement 

In the case of bipartite systems one of the fundamental results concerning en- 
tanglement is that all entangled states are mutually interconvertible by LQGG 
(asymptotically, with respect to the number of copies available). Nothing simi- 
lar is true for multipartite entanglement. It seems that in the case of m-partite 
systems there are at least 2™ — m — 1 kinds of asymptotically inequivalent en- 
tanglement. (This corresponds, for n = 2, . . . ,m, to all ways different 

subsets of n parties can be chosen.) Actually, the understanding of inequivalent 
ways in which subsystems of a multipartite composite system can entangle is 
one of the central problems not only of multipartite entanglement, but of all 
quantum information theory. 

The study of entanglement of multipartite systems is at the very beginning 
even if they are exactly those quantum systems from which one expects the 
most significant applications — for example for distributed computing, quantum 
networks or even for understanding of the behaviour of strongly-coupled many- 
body systems. 

Some of the basic problems to deal with are easy to formulate, but usu- 
ally difficult to solve. For example: How to detect entanglement of multipartite 
systems? What is the structure of the set of n-partite pure (mixed) entangled 
states? How many inequivalent types of entanglement we have in multipartite 
systems? Which sets of entangled states are interconvertible by LQGG (exactly 
and asymptotically)? Which types of entanglement we have in such multipar- 
tite systems (for example what are properties of bound entangled states)? What 
are the laws entanglement can be shared among n parties? What are proper 
axioms for measures of entanglement of multipartite systems? Which measures 
of entanglement of multipartite systems are useful and what are the relations 
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between them? In general, quantitative and qualitative theory of multipartite 
entanglement is needed. 

So far only very partial results are known, but they well demonstrate com- 
plexity of the task. 

For example, Diirr et al. (2000) have shown that in an n-partite system, 
with n > 4 if two pure states are chosen randomly, then they are stochastically 
incomparable in the sense that neither state can be produced from other by 
SLQCC. Moreover Bennett et al. (1999) have shown that two GHZ states are 
LQCC-incomparable with three EPR states. 

9.3 Reversible Transformations of Multipartite States 
and Entropies 

In order to study various types of multipartite entanglement the basic question 
is for which pairs of (sets of) states S and R we can transform S into R in a 
reversible way using transformations based on LQCC. 

For example one GHZ-state can be transformed into one singlet (but not 
reversibly), and two singlet into one GHZ state (but again not reversibly). 

One way to show that some reversible transformations do not exist is to 
consider how various types of entropies are changed by a transformation process. 

It is clear that entropy must be constant during a reversible process, because 
entropy cannot get increased by such a process. 

A useful tool to study reversible transformations has been developed by Lin- 
den et al. (1999). They have shown that at any reversible transformations based 
on LQGG the average relative entropy of entanglement of any pair of parties 
must be constant. This allowed them to show it is not possible to convert 2n 
GHZ-states into 3n singlets, even asymptotically. 

An interesting open problem is whether any three-partite state can be trans- 
formed in a reversible way into GHZ states and singlets. 

9.4 Sharing of Entanglement 

Another basic question concerning multipartite entanglement that has already 
been investigated is the following one: to what extend does entanglement between 
two objects restrict their entanglement with other objects? 

For example, if two qubit-particles are maximally entangled, then they can- 
not be entangled with other particle (this property of entanglement is usually 
called monogamy) . One can therefore expect, in general, that if two particles are 
entangled, then they can have only limited entanglement with other particles. 
Let us now discuss several results that have been obtained recently on sharing 
of entanglement. 

Goffmann et al. (1999) considered the following problem: given a pure state 
of three qubits, say A,B, C, how is tangle between A and B, notation tab, 
related to tangle between A and C, notation tac- They showed that 



tab + tac < ddetpA - 
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what can be interpreted that tangle between A and _B, plus tangle between A 
and C, cannot be greater than tangle between A and the pair 

Bounds on the achievable nearest neighbours entanglement have been ob- 
tained also for the case qubits form a bi-infinite translationally invariant linear 
chain or a finite ring. (Translational equivalence in these cases means that global 
states to consider are invariant under all shifts of qubits. As a consequence of 
the translational equivalence, the degree of entanglement between neighbours 
has to be constant through the chain or ring.) Wooters (2000) has shown that 
in the case of bi-infinite chains it is possible to achieve an entanglement of for- 
mation equal to 0.285 ebits between neighbours and this is the best possible 
under certain assumptions. Actually, his main result is formulated in terms of 
concurrence as a measure of entanglement. For the case of rings, O’Connor and 
Wooters (2000) derived, under certain assumptions, formulas for the maximum 
possible nearest-neighbour entanglement measured by concurrency. 

Sharing of entanglement between n qubits in the case that all pairs of qubits 
are correlated in the same way, and therefore one can say that entangled pairs 
of qubits form a complete graph, was investigated by Koashi et al. (2000) and 
Diirr et al. (2000). They showed that under the above assumptions the maximal 

entanglement between any pair of qubit, measured in terms of concurrence, is 
2 

n ■ 

Diirr (2000) has shown that in the case of n qubits entangled pairs of qubits 
can form in general any graph of n vertices. 



9.5 Measures of Entanglement of Multipartite Systems 

One of the key problems concerning multipartite entanglement is that it is far 
less clear how to measure it than in the bipartite case. How to define appropriate 
measures of entanglement of multipartite quantum states. 

The problem seems to be quite complex. In the case of bipartite systems 
of two qubits the most natural measures of entanglement are entanglement of 
formation and distillation. According to these measures the amount of entan- 
glement in a mixed state \p) is the number of maximally entangled states (Bell 
states) that can be distilled from p or are needed to form p. However, it is not 
yet clear whether entanglement of multiparticle states can be defined in terms 
of such a “standard currency”. It is not even clear what maximally entangled 
states are for multipartite systems. 

For example, in the case of three qubits it seems that GHZ states and Bell 
states form a set from which all other pure states can be generated reversibly, 
but there is no proof of it. (It seems that some other states may be needed.) 

One natural approach to explore this problem is to generalize those measures 
of entanglement for bipartite systems to multipartite systems that are not based 
on a choice of an unit of entanglement. From this point of view of interests is to 

® It make sense to speak of the tangle between A and BC, because, even if the state 
space of BC is four-dimensional, only two of these dimensions are necessary to 
express pure state of ABC. 
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generalize relative entropy of entanglement and measures introduced by Nielsen 
(2000) and discussed above. 

An attempt to do such generalization for the entanglement of formation is 
presented in Wang (2000) . Especially entanglement of formation of three-particle 
systems is studied in depth. Entanglement of a mixed state p of a three-partite 
system A® B ® C is defined by 

Ef{p) = ~[Ef{pAB) + S{pab) + Ef{pAc) + S{pAc) + Ef{pBc) + S{pBc)] + 

+ g[>5'(pA) + S{pb) + ^'(pc)], 

where S denotes von Neumann entropy 

A (straightforward) generalization of the entanglement of relative entropy 
has been developed by Plenio and Vedral (2000) and Wang (2000a). 

For example, they have shown that for regularized version of the rela- 
tive entropy of entanglement for three-partite system A ® B ® C, E'^{p) = 
lim„_>oo ^ it holds 



E“(p) = \{Er{pAB) + E“(psc) + E^{pAc)) + \iSipA) + S{pb) + S{pc)). 

10 Entanglement as a Catalyst 

In some cases, for example at teleportation, entanglement can help to perform 
communication or transmission of quantum states, but in doing that entangled 
states get destroyed. 

Surprisingly enough, there are also cases when “borrowed entanglement” can 
assist to achieve what without entanglement cannot be done and that “used” 
entangled states do not get destroyed and at the end of the activities entangled 
states can be “returned” . 



10.1 Entanglement Assisted Quantum State Transformations 

Jonathan and Plenio (1999) have shown that the mere presence of a (borrowed) 
entangled state can be an essential advantage when the task is to transform 
one quantum state into another with local quantum operations and classical 
communications (LQCC). 

They have shown that there are pairs of pure states {\(j)i) , \(j) 2 )) such that 
using LQCC one cannot transform \(f>i) into |</> 2 ), but with the assistance of an 
appropriate entangled state \tp) one can transfer |^i) into |^ 2 ) using LQCC in 
such a way that the state \ip) is not changed in the process (and can be “returned 
back” after the process). In such a case \ip) serves as a “catalyst” for otherwise 
impossible transformation (reaction). 




Power, Puzzles and Properties of Entanglement 



57 



10.2 Entanglement Assisted Quantum Channel Capacity 

A special type of quantum channel capacity has been introduced by Bennett 
et al. (1999a). Namely, entanglement assisted classical capacity, Ce, to 
be defined as a quantum channel capacity for transmitting classical information 
with the help of pure entangled states (an unlimited amount of them) shared 
between the sender and the receiver. 

The existence of superdense coding protocol, implies that Ce{J^) > 2C{N), 
for any noiseless quantum channel A/”, where C{Af) is the classical capacity of 
the channel Af. 

In the case of noisy quantum channels, surprisingly, CE(Af) can be larger, 
comparing with C(Af), by an arbitrarily large constant factor — even in the 
case the channel Af is so noisy that QC 2 {Af) = 0, where QC^iAf) is quantum 
capacity of the channel Af for the case that two-way classical communication is 
allowed. By using a noisy quantum channel Af in the superdense coding protocol 
one can obtain a lower bound on CE{Af). On the other side, by using a noisy 
classical channel in the teleportation protocol to simulate a quantum channel Af 
one can obtain an upper bound on CE{Af). This helped to determine exactly 
CE{Af) for d-dimensional depolarizing and erasure channels. CE{Af) has also 
been determined exactly for so-called “Bell-diagonal channels” (that commute 
with superdense coding and teleportation), so that Tp{Sd{Af)) = Af {Tp stands 
for teleportation, ....). 

11 Bound Entanglement 

One of the most surprising discoveries concerning entanglement of mixed states 
was due to Horodeckis (1998a). They showed that there is a “free”, distillable 
entanglement; and a “bound” entanglement that is not distillable. That is, there 
are entangled states from which no pure entangled states can be distilled and 
therefore such states are not useful for quantum teleportation and (it seems) for 
quantum communication. Horodeckis (1997a) showed that any entangled mixed 
state of two qubits can be distilled to get the singlet. However, this is not true 
in general for entangled states. 

(Horodeckis, 1998a) considered two qutrit mixed states 

a+ = i(|0)|l)(0|(l| + |l)|2)(l|(2| + |2)|0)(2|(0|), 
a_ = i(|l)|0)(l|(0| + |2)|l)(2|(l| + |0)|2)(0|(2|) 

that are separable and using these states they defined a one-parameter family 
of states 

O-a = + ycr+ -k 

where = :^(|0)|0) + |1)|1) + |2)|2)), such that these states are separable if 
2 < a < 3; they are bound entangled if 3 < a < 4 and they are free entangled if 
4 < a < 5 
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One of the peculiar features of bound entangled states is that entanglement 
has to be invested in order to create them by LQCC, but this invested entangle- 
ment cannot be distilled/recovered by LQCC. 

A first general method to construct states with bound entanglement (bound- 
entangled states), using the so called unextendible product bases was developed 
by DiVincenzo et al. (1999)0 Their family of bound-entangled states depends 
on 6 parameters. Another construction of a family of bound-entanglement states 
with 7 parameters, is due to BruB and Peres (1999). 

The goal of attempts to find explicit constructions of bound-entangled states 
is to help to elucidate properties of such states. For example, to deal with the 
following problem: 

Open problem. Is it true that bound-entangled states satisfy all Bell in- 
equalities and therefore are compatible with the hidden variable interpretation 
of quantum mechanics? 

A simple bound-entangled state in H 2 0 H 2 0 H 2 0 H 2 was discovered by 
Smolin (2000). It is the state (let us call it Smolin’s state), 

1 4 

^ i=l 

where <Pi, i = 1,2, 3, 4, are all Bell states. The above state is bound-entangled if 
the corresponding particles are possesd by four non-cooperating parties (parties 
in “different labs” ) — in such a case the parties cannot distill from it a Bell states. 
However, if we denote parties as A, B, C, D, then the above state is {A, B} : 
{C,D}, {A,C} : {B,D} and {A,D} : {B,C} separable. This is an example of 
so-called “unlockable-state” that can be used for two of the parties to distill a 
Bell state provided the other two parties assist them. 

An interesting, important and puzzling question remains whether one can 
make some use of bound-entangled states. In other words, whether there are 
tasks that can be performed better when assisted by bound-entangled states 
than with just classically correlated states. 

The first natural question along these lines is whether bound-entangled states 
can be of some use for quantum teleportation. Of course, they are not usable 
for perfect teleportation because one cannot distill from them maximally en- 
tangled pure states. In addition, Linden and Popescu (1998) have shown that 
with bound-entangled states teleportation cannot be achieved with better than 
classical fidelity. 

Till recently no protocol has been known at which bound-entangled states 
would performed better than classically entangled states. The first example of 
some use of bound-entangled states was demonstrated by Murao and Vedral 

An unextendible product basis (UPB) of H is such an orthonormal set S of product 

states from H that the subspace Hs generated by S' is a proper subspace of H and 

the orthogonal complement of Hs contains no product state. 



Power, Puzzles and Properties of Entanglement 



59 



(2000). They showed how bound-entanglement can be used for remote infor- 
mation concentration — the reverse process to quantum telecloning. 

In the information concentration scheme there are four parties, Alice, Bob, 
Charles and David and they share four particles in Smolin’s state. The particles 
of Alice, Bob and Charles represent their input port qubits and that of David 
the output port qubit. Bob and Charles posses perfect clones of an unknown 
qubit state \4>). At the end of the remote information concentration scheme 
David’s particle gets into the state \(t>). In the protocol Alice, Bob and Charles 
first perform Bell measurement between their ancilla or optimal clones and their 
particles of the Smolin state. They communicate results of their measurement 
to David (who gets in total 6 bits), who then performs a product (determined 
uniquely by six communicated bits) of Pauli matrices on his particle to get the 
state 

However, bound entangled states can be activated (Horodeckis, 1998b) and 
super activated (Shor et ah, 2000). 

Activation of bound entanglement, or the so called quasi- distillation, refers 
to the process in which a finite number of free entangled mixed states are dis- 
tilled with the assistance of a number of bound entangled states, but without 
such an assistance no useful entanglement can be distilled for these states. In 
a superactivation two bound entangled states are combined (tensored) to get a 
state which is not bound entangled - in other words, in this case bound entan- 
gled states are activated by bound entangled states — and therefore it can be 
distilled. Several examples of superactivation discuss Diirr and Cirac (2000). 

Bound entangled states have various interesting properties. Let us summarize 
some of them. 

— A bound entangled state remains bound entangled when subjected to any 
LQCC. 

— There is no bound entangled state of two qubits. 

— Each PPT-state is bound entangled and it is an open problem whether there 
are also bound entangled states that are NPT-statesliJ 

In spite of a variety results concerning bound entanglement the very fun- 
damental question remains open: What is the role of bound entanglement in 
Nature? It seems useless for communication, but this does not have to be the 
whole story. Another fundamental question is about nonlocality of bound entan- 
gled states? 

12 Quantum Information Processing Principles 

The thesis that information is physical and that the role of information in physics 
is analogous to that of energy in thermodynamics leads naturally to the search 

DiVincenzo et al. (1999) exhibit a canonical two-parameter family S of bipartite 
mixed states in Hilbert space Hd®Hd, which are NPT-states and there is an evidence 
that from them one cannot distill using LQCC maximally entangled states of two 
qubits. These states are canonical in the sense that any bipartite NPT state can be 
reduced by LQCC to a state in S. 
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for information processing principles and laws. For example, for principles and 
laws analogous to those in thermodynamics. It is only natural that quantum en- 
tanglement is expected to play the key role in such principles and laws. One such 
emerging principles seems to be no-increasing of entanglement principle — 
under local quantum operations and classical communications — (Horodeckis, 
1997). 

Informally, this principle says that entanglement cannot increase under local 
operations and classical communication. More formally, it says that if if is a 
“proper” measure of entanglement and an input state p is transformed by quan- 
tum operations into a mixed state Pi)}) then ^i^iPiE(pi) < E{p). 

This principle has been proven to hold for entanglement of formation (see 
Bennett et al. 1997a). A form of this principle has been proven to be equivalent 
with no-cloning theorem. 

12.1 Thermodynamics and Entanglement 

The above entanglement processing principles are connected with a belief of some 
that there are laws of governing entanglement processing that are analogous to 
those of thermodynamics. The entanglement is considered then to play the role 
of energy. This entanglement-energy analogy has been farther explored recently 
by Horodeckis (2000). They postulate that 

— Entanglement is a form of quantum information corresponding to internal 

energy; 

— Sending qubits corresponds to work. 

They have also shown that in the closed bipartite quantum communication 
systems, information is conserved and so is in deterministic distillation protocols. 

In order to consider balance of quantum information they introduce, in ad- 
dition to the concept of physical work, the concept of useful logical work 
in quantum communication, to be the amount of qubits transmitted without 
decoherence. 

13 Entanglement Based Quantnm Cryptography 

In the rest of the paper we discuss some of the areas where entanglement seems 
to have potential to yield significant practical applications. The first such area 
is quantum cryptography. 

Quantum key generation (QKG) is another area where entanglement has 
been shown to be of large importance. 

There is in principle a very easy and clearly unconditionally secure way for 
two parties, Alice and Bob, to make use of entanglement to generate a random 
key. 

Let Alice and Bob share n pairs of particles, each pair in the entangled state 

4 (| 00 ) + | 11 )). 
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4 



4 



4 



n pairs of particles in EPR state 



If both parties measure their particles in the standard basis (and it does not 
matter in which order), they receive, as the result of their measurement, the 
same random binary string of length n. This way of binary key generation is 
absolutely secure because no information is transmitted. 

The catch is that if Alice wants to generate with a party a random key, 
it is non-trivial to achieve that they share n pairs of particles in the above 
pure entangled state. This can be done, using one of the purification techniques 
discussed above, but it is a complex process and both parties need to posses a 
quantum processor in order to do that. 

The above way of generating random key is also a good way to explain why 
we say that entanglement does not allow a communication. In the case of two 
parties, Alice and Bob, we say that Alice has communicated with Bob if she did 
something what had an effect on measurements at Bob side. In the case of the 
above protocol Alice and Bob knows at the end that they have the same key, 
but none of them has a way to influence which one. 



13.1 Ekert’s Protocol and Its Generalizations 

In 1991 Ekert discovered a new type of QKG protocols, security of which is 
based on the completeness of quantum mechanics. Completeness here means that 
quantum mechanics provides maximum possible information about any quantum 
system and an eavesdropping can then be seen as “an introducing some elements 
of physical reality” . To each specific entanglement-based QKG protocol a (gen- 
eralized) Bell inequality is formed and by a statistical test of this inequality, on 
the basis of the probability statistics on rejected data, it is possible to determine 
whether eavesdropping took place. (Eve plays here the role of a hidden variable 
and disturbs the quantum nature of the correlations between the probabilities 
of occurrence of various outcomes for various measurements.) 

Example. Let a source emit spin-| particles in the state :^(|01) — |10)). 
Alice performs her measurement with respect to the angles 0°, 45°, 90° and Bob 
with respect to angles 45°, 90° and 135°. 

Denote E(t,j, 61,62) the probability that if Alice measures with respect to 
vector Ui, Bob with respect to [3j, then Alice (Bob) gets as the outcome 61 (62). 
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Let us also denote 

+ L;(t, j,0,0) - E{i,j, 1,0) - 
In such a case quantum mechanics requires that for 

= S 

it holds S = —2y/2 but if an eavesdropping introduces an element of physical 
reality, then it holds 

-2<S <2. 



13.2 Entanglement and Unconditional Security 
of Cryptographic Protocols 

The positive role entanglement can play for quantum key generation, to the ex- 
tend that we can have unconditionally secure quantum key generation, created 
a hope that we can have unconditional security also for such basic cryptographic 
primitives as (exact) quantum coin tossing and especially for bit commitment. 
However, this has turned out not to be the case. It has ben proven that uncon- 
ditionally secure quantum bit commitment is not possible. Cheating is always 
possible - due to the entanglement. This time it is just entanglement playing 
the role of the bad guy. 

13.3 Quantum Secret Sharing, Strong Quantum Secret Sharing 
and Quantum Cobweb 

Several variants of the classical secret sharing problem have been considered 
Secret Sharing. There is a simple method, due to Hillery, Buzek and Berthi- 
aume (1998), how Alice can distribute a (secret) qubit \(j)) = a|0) -I- /3|1) to Bob 
and Charles in such a way that they have to cooperate in order to get \(f>). 

The basic idea is that Alice couples a given particle P in the state |</>) with 
the GHZ state |^/>) = ^(|000) -I- |111)) of three particles Pa, Pb and Pc she 
shares with Bob and Charles and then performs a measurement on the state of 
particles P and Pa, with respect to the Bell basis Since 

1^)1^) = ^(l<^+)(«|00) + P\n)) + |<?-)(a| 00 ) - /3|11)) 

+ |tf'+)(/3|00) + a|ll)) + + o|ll))), 

the outcome of the measurement is that particles Pb and Pc get into one of the 
states 

^(a|00) + /3|11)), ^(«|00) - /3|11)), ^(/3|00) + a|ll)), ^(-/3|00) + a|ll)) 

and Alice gets two bits to tell her about which of these four cases happened. 
However, neither Bob nor Charles has information about which of these four 
states their particles are in. 
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Bob now performs a measurement of his particle with respect to the dual 
basis. He gets out of it one bit of information and Charles’s particle Pc gets into 
one of 8 possible states, which is uniquely determined by bits both Alice and 
Bob got as the results of their measurements, and which can be transformed into 
the state |</>) using one or two applications of Pauli matrices. Secret sharing was 
experimentally verified by Tittel et al. (2000). 

A more general problem is to design methods for the so called quantum 
(t, n) threshold sharing schemes. That is how one can “partition” or “share” a 
quantum “secret” (state) among n parties in such a way that, for a given t, any 
t of parties can, by cooperation, reconstruct the secret, but no combination of 
t — 1 of parties is able to do that. 

This problem has been solved in full generality by Cleve et al. (1999). They 
showed that such secret sharing protocol exists if /c > This limitation is due 
to No-cloning theorem. 

Classicaly, also more general secret sharing schemes are considered at which 
the so called access structure - sets of users that should be able to get secret 
- is specified. Monotonicity of the access structures is the only requirement for 
the existence of classical secret sharing schemes. Gottesman (1999) has extended 
the results of Cleve et al. (1999), to consider also more general access structures 
with the outcome that monotonicity and No-cloning theorem provide the only 
restrictions on the access structures. 



Secret Sharing in a Strong Sense. In the secret sharing scheme mentioned 
above there is no guarantee that shared secret remains really secret if two in- 
volved parties start to communicate classically. Moreover, as shown by Walgate 
et al (2000), there is no way to hide a bit between two parties if classical com- 
munication is possible and shares consist of parts of two orthogonal pure states. 

Let us say that a secret is shared by two parties in a strong sense if classical 
communication between parties is not sufficient for them to get the secret. 

Terhal et al. (2000) have shown that it is possible to share a single bit, using 
mixed quantum states, between two parties in such a way that using classical 
communication they can obtain only arbitrarily little information about the 
bit, but they can get the bit only by joint quantum measurement. Moreover, 
surprisingly, not too much entanglement is needed to share/hide one bit. 

The protocol suggested by Terhal et al. (2000) is surprisingly simple and the 
integer n used in the protocol is the degree of security of the protocol. The hider 
Alice chooses uniformly and randomly n Bell states with the only restriction 
that the number of singlets between chosen states is even (odd) if the bit to hide 
is 0 (1). Alice then distributes the chosen states to Bob and Charles, in such a 
way that the first (second) particle of each entangled pair goes to Bob (Charles) . 

Surprisingly, it can be shown that the state shared by Alice and Bob has 
actually much less entanglement as the above protocol seems to indicate. Indeed, 
the state hiding the bit 0 can be prepared without the use of the entanglement 
and the state hiding the bit 1 can be prepared using just one singlet. 
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Quantum Cobweb. Pati (2001) has discovered another surprising use of entan- 
glement. He has shown a way how a secret (an unknown qubit) can be diluted 
between n remotely located parties being in a special entangled state (ampli- 
tudes of which sum to zero), in such a way that these parties cannot disclose 
secret using (joint) unitary operations, no matter how they cooperate. This type 
of quantum channel where an unknown qubit once enters remains interwinded 
with multipartites is sort of “a quantum cobweb”. (However, it is not clear 
whether such a state could be recovered by EALQCC.) 
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Abstract. The goal of this paper is to give a survey on recent work 
studying several combinatorial and computational problems on finite sets 
of words. The problems are natural extensions of three well-understood 
problems on words, namely those of the commutation, the conjugacy, 
and the equivalence of morphisms on regular languages. Each of these 
problems provides challenging questions in the case of finite sets of words. 



1 Introduction 

Combinatorics on words is a relatively new area on discrete mathematics related 
and motivated by theoretical computer science. It is an enormous source of 
fascinating problems, see e.g. IKal . Some of the problems are very difficult, and 
their solutions have become jewels of discrete mathematics. The satisfiability 
problem of word equations is such an example, see IMal . IPII . and for its 
solution. 

Some other problems are not difficult to prove, but have turn out extremely 
fruitful for many applications, for example in efficient string matching algo- 
rithms, cf. jEE]- Examples of such basic results of words are, for instance, the 
characterization when two words commute and the periodicity lemma of Fine 
and Wilf, cf. (DEH- 

In this paper we consider three well-understood problems on words, and try to 
define and analyze what can be said about their natural extensions for finite sets 
of words. The problems are the commutation problem, the conjugacy problem, 
and the equivalence problem of morphisms on regular languages. Consequently, 
in the first and the second problem we ask to characterize when two elements 
commute or are conjugates, respectively, while in the third problem we ask for 
an algorithm to decide whether two morphisms h and g are equivalent on all 
words of a given regular language L, i.e., h{w) = g{w) holds for all w in L. Each 
of these problems have a simple solution. 

The above three problems can be formulated, in a natural way, for finite 
sets of words. However, the nature of the problems changes drastically: none 
of these problems remain simple, and only partial solutions are known. Even 
more interestingly several challenging variants of these problems can be stated. 
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This is the point we want to make in this paper. In very general terms we are 
looking at what happens to certain simple problems when a deterministic set-up 
is extended to a nondeterministic one, when words are replaced by finite sets of 
words. 

Most of the results of this paper have been achieved over the past few years 
in a collaboration with several researchers, see raa, EEHi.Ea,^, and 

EIESl- 

2 Preliminaries 

In this section we fix the terminology and notations used later, for more on words 
we refer to M or and on automata and formal languages to Em. 

A finite alphabet is denoted by A, and the free semigroup (resp. free monoid) it 
generates by A+ (resp. A*). Elements (resp. subsets) of A* are called words (resp. 
languages). The empty word is denoted by 1. We are mainly dealing with finite 
languages. The monoid of finite languages under the operation of the product 
is denoted by P{A*). We use lower case letter u, v, w, . . . to denote words and 
capital letter U, V, W, ... to denote languages. For unknowns assuming words 
(resp. languages) as their values we use letters x, y, z, . . . (resp. X,Y, Z, . . .). 

A morphism from A* into B* is a mapping h satisfying h{uv) = h{u)h(y) for 
all words u and v in A* . Its nondeterministic variant, a finite substitution, is a 
morphism ip : A* ^ V{B*). Let L C A* be a language and h, g : A* ^ B* two 
morphisms. We say that h and g are equivalent on L if 

h{w) = g{w) for all w in L. (1) 

Similarly, we can define the equivalence of finite substitutions ip,f} ■. A* ^ V{B*) 
on a language L, when the equality in (P) is, of course, the set equality. More 
generally, a monoid B* above can be replaced not only by the monoid V{B*), 
but also by an arbitrary monoid. 

Some of our problems can be stated in terms of equations over a monoid. Let 
S' be a finite set of unknowns and M a monoid. An equation with unknowns in 
S is a pair (a, (3) G x S'*" and its solution in M is & morphism s : S* — >■ M* 
satisfying s(a) = s(/3) in M. Accordingly, we define systems of equations and 
their solutions. We say that two systems of equations are equivalent in M if they 
have exactly the same solutions in M, and that a system is independent in M if 
it is not equivalent in M to any of its proper subsystems. 



3 Problems 

In this section we formulate our three problems. We need to recall that two 
elements a and /3 of a monoid M commute if 



a/3 = /3a 
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and are conjugates if 

37 S M : a'y = 7/9. 

The following problems are very natural: 

Problem 1 (The Commutation Problem). Characterize when two elements of a 
monoid M commute. 

Problem 2 (The Conjugacy Problem). Characterize when two elements of a mon- 
oid M are conjugates. 

Our third problem is a decision question: 

Problem 3 (The equivalence of morphisms on regular languages) . Given a regular 
language L C A* and morphisms h,g : A* ^ M. Decide whether or not h and 
g are equivalent on L. 

The choice of L here might look rather arbitrary. However, from the point 
of view of this paper there exist two good reasons to assume that L is regular. 
First, it makes Problem El easily decidable for free monoids B* . Second, and 
more importantly, it makes the problem very interesting for the monoid of finite 
languages, that is for finite substitutions. 

Of course, by choosing the language L in a more complicated way. Problem 
01 even in the case of free monoids, can be made difficult, or even open. This, 
however, is outside of the scope of this paper. 

4 The Word Case 

In this section we recall the solutions of Problems 1-3 for word monoids A* . 

Proposition 1. The following conditions are equivalent for words u and v in 
A*: 

(i) there exists a word p such that u,v G p* ; 

(ii) u and v commute, i.e., the pair (u,v) satisfies the equation xy = yx; 

(Hi) the pair (u,v) satisfies a nontrivial equation. 

A proof of this important result, by induction based on the length of uv, 
requires only a few lines. The equivalence of (i) and (ii) gives a simple combi- 
natorial characterization for the commutation of two words. The equivalence of 
(i) and (iii), in turn, states a special case of the well known defect theorem, see 
e.g. pEEl: if two words satisfy a nontrivial relation, then these words can be 
expressed as a product of a single word, i.e., as a power of a single word. It is 
interesting to note that natural attempts to extend this result leads to an open 
problem on words: it is not known whether or not any independent system of 
equations with three unknowns and three equations has only periodic solutions, 
i.e., all components are powers of some word. For two equations this is not the 
case, cf. e.g. 

Similarly, the conjugacy of words has a combinatorial characterization: 
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Proposition 2. The following conditions are equivalent for words u and v in 
A+: 

(i) there exist words p and q such that u = pq, v = qp: 

(a) there exists a word w such that uw = wv, i.e., the triple (u,w,v) is a 
solution of the equation xz = zy; 

(Hi) there exist p and q such that for some w, u = pq, v = qp and w € {pq)*p. 

By (i) the conjugacy of words means that one is obtainable from the other 
by a cyclic permutation, i.e., by moving a prefix of the first word to its end. 
Condition (ii), in turn, shows that the property “being conjugates” is express- 
ible as a set of all solutions of a word equation. Here an extra unknown, namely 
z, is needed while in the similar characterization for the commutation no such 
unknown is necessary. A systematic research on expressive power of word equa- 
tions was initiated in impi . Finally, the condition (iii) gives the full solution 
of the conjugacy equation xz = zy. 

A solution to Problem 0 is based on the following result . 

Proposition 3. Let L C A* be a regular language accepted by an n-state finite 
automaton. Then for any morphisms h, g : A* ^ B* the following conditions 
are equivalent: 

(i) h{w) = g{w) for all w G L; 

(ii) h{w) = g{w) for all w € F = {w € L \ \w\ < 2n}. 

The equivalence of (i) and (ii) is based on the pumping lemma of regular 
languages and a simple combinatorial property of words. Proposition 3 immedi- 
ately implies an affirmative answer to Problem0for morphisms of free monoids. 
Actually, it states even more, namely that any regular language L possesses a 
finite test set F which, moreover, can be found effectively, i.e., a finite subset F 
of L such that to test whether two morphisms are equivalent on L it is enough 
to test that on F. An important and fundamental property (the Ehrenfeucht 
Compactness Property) of free monoids asserts that any language possesses a 
finite test set, but, in general, it needs not be effectively findable, see ^nj, |AL|, 
and jcitKj . 

5 The Finite Language Case 

In this main section we consider the above problems for finite sets of words. As 
we shall see the problems become much more intriguing, in particular in many 
cases even very restricted variants of these problems are of great interest. We 
consider each of the problems separately. 



5.1 The Commutation 

We consider the equation 



XY = YX 



( 2 ) 
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on the monoid of languages. Our main interest is in the case when languages, or 
at least one of those, is finite, although one of our main problems is formulated 
for regular languages. We concentrate on the case when languages are subsets of 
, i.e., not containing the empty word. It follows immediately that for a fixed 
X there exists the unique maximal set commuting with X, and moreover it is a 
semigroup. It is obtained as the union of all sets commuting with X and referred 
to as the centralizer oi X, in symbols C(^). It was already in 1970 when Conway 
posed the following problem: 

Problem 4 (Conway’s Problem). Is the centralizer of a regular set also regular? 

Surprisingly, this has turn out to be very intriguing problem. Indeed, even 
the much simple problem seems to be still unanswered: 

Problem 5. Is the centralizer of a regular (or even finite) set recursive? 

It is interesting to note that a similar problem asking whether a decomposable 
regular language is decomposable via regular languages has a simple affirmative 
answer. In other words, if we know that a regular language L can be expressed 
as a product L = X ■ Y, with X,Y ^ {!}, then X and Y can be replaced by 
regular languages, in fact even by regular supersets of X and Y , respectively. 
This fact, noticed already in 10^, is a simple consequence of the finiteness of 
the syntactic monoids of regular languages. 

Some solutions of an equation m are easy to give: For a given X the sets 

like 

Y = X"^, Y = X + X^ + X^, and Y = X+ 

are clearly solutions, while the set X = A+ may or may not be a solution, cf. for 
example sets X\ = {a, aaa,b,ab,ba, aba} or X 2 = {a, ab,ba,bb}. Indeed, {a,b} 
commutes with Xi and b is not in any set commuting with X 2 , since ba ^ X 2 A'*'. 

Motivated, by the above examples, we can pose the following interesting 
problem: 

Problem 6 (BTC-property). Are the following conditions equivalent: 

(i) XY = YX; 

(ii) there exist a set V C A+ and sets of indices /, J C N such that 

X=|JW and Y=[jVP (3) 

ie/ iGJ 

Problem 0 deserves several comments. First, the implication (ii) (i) is 
trivial. Second, the problem asks whether languages have a “similar” characteri- 
zation for the commutation than the words have. Third, and most interestingly, 
a deep result of Bergman, cf. lEi, shows that the statement of the problem 
holds for polynomials over a field with noncommuting unknowns. It follows that 
Problem 0 has an affirmative solution for finite multisets of words. Amazingly 
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no combinatorial proof seems to be known for this nice result. Fourth, the ab- 
breviation comes from Bergman’s result: Bergman type of characterization. 

Finally, we note that, in general, the answer to the question of Problem El 
is “no”, a counterexample being Xi above, i.e., the set {a,aaa,ab,b,ba,aba}, 
which commutes also with a finite set X3 = X\ U {aa}, and clearly Xi and X3 
are not expressible in the form Q . Consequently, it remains a problem of finding 
families of languages for which the BTC-property hold, that is, for which the 
commutation of two sets can be characterized essentially as in the word case. 

In what follows we give an overview of results related to the above problems. 
We start with a simple result, cf. m- 

Theorem 1. The centralizer of a regular (or even recursive) language is in CO- 
RE. 

Theorem 1 is achieved by a straightforward application of the exhaustive 
search principle. It immediately implies that Problem 0 can be stated equiva- 
lently as a problem whether the centralizer is recursively enumerable. 
Concerning the partial solutions of Conway’s Problem we have: 

Theorem 2. Conway’s Problem has an affirmative answer in the following cas- 
es: 

(i) X is a prefix code, see jRal/ : 

(ii) card{X) = 2, see ICK(J\1: 

(Hi) X is a code which is elementary, synchronizing or contains a word of length 
1, see ^CKC>^ : 

(iv) card{X) = 3, see I'KPI : 

(v) X is an co-code, see jffPF . 

Some of the above results are special cases of the others. All of them are 
listed here partially to point out the chronological order, but also to recall the 
original papers, since the proofs are typically quite different. All results except 
(v) are based on the combinatorial considerations on words, and in particular 
so-called Graph Lemma, see IM, plays an important role. The proof of (iv) 
is quite involved. Finally, the proof of (v) uses the above mentioned Bergman’s 
theorem. 

As a conclusion, Conway’s Problem remains unanswered in many cases. 
Maybe the three most interesting subcases are the following: 

- card{X) < 00; 

- card{X) = 4; or 

- A is a finite code. 

Typically, the proofs of results in Theorem 2 settle the validity of the BTC- 
property for the corresponding sets: 

Theorem 3. The BTC-property holds in the following cases: 

(i) X is a prefix code, see fR.al^ : 
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(ii) card{X) = 2, see J CK(Jj : 

(in) X is a code which is elementary, synchronizing or contains a word of length 
1, see EEHI; 

(iv) card{X) = 3 and X is a code, see IKFt : 

(v) X is an uj-code, see m- 

The only place where the conditions (i)-(v) are different in Theorems 2 and 
3 is the case (iv): in Theorem 3 an additional requirement that X is a code is 
needed. 

As we already mentioned the BTC-property does not hold even for all finite 
sets. A particularly small counterexample was given in ^IED|: 

Example 1. Let X = {a,ab,ba,bb}. Then X commutes with 

Y = Xu U {bab, bbb}. 

Clearly, X and Y do not satisfy the condition of (2). Note also that C(X) = 
{a,b}+\{b}. 

From the above we can formulate the main open cases of ProblemElas follows: 

- card{X) = 3; 

- card{X) = 4 and X is a code; 

- card{X) < oo and X is a code. 

There are no difficulties (except the computational ones) to test whether 
two regular languages commute. However, if we go into a bit more complex 
languages, the situation changes drastically, as shown by the following result 
proved in [HiKSj . 

Theorem 4. Given a two-element set X = {a, (3), with a(3 ^ (3a, and a 
context-free language L, it is undecidable whether X and L commute. 

We proceed by illustrating a basic property of the centralizer of X (or in fact 
of any Y commuting with X). Clearly, we have: 

xe(x) = e(x)x ^ x"e(x) = e(x)x" for aii n > i. 

Consequently, by Konig’s Lemma, for any z G C(X) and any sequence X\, X2, 
. . . of elements of X we can find a sequence yi, j/2, • ■ • of elements of X such 
that za;ia;2 . . . = t/ij/2 ■ ■ ■ ■ Similarly, we can consider extensions to left. The 
illustration is shown in Figure 1. 

The property illustrated in Figure 1 has some nice consequences: 

Fact 1. For each X C A+ we have 

X+ C e(X) C Pref (X+) n Suf (X+). 

In order to state another fact, let us say that a set X C A+ is left (resp. 
right) complete if, for each w G A~^ , w G Suf{X+) (resp. w G Pre/(X+)). 
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Fig. 1. An illustration how each z G C(A') defines a double factorization of a bi-infinite 
word . . . y- 3 y- 2 y-iyiy 2 y 3 ■■■ = ■■■ X- 3 X- 2 X- 1 X 3 X 2 X 3 



Fact 2. If C(Ai) = A+, then X is both left and right complete. 

Several simple observations can be based on Fact 2. The centralizer of a three 
element set X is never A+, unless = A+. Indeed, the only three-element sets 
which are both left and right complete are the sets of form {a, b, x}. 

Further, by simple analysis only left and right complete four element sets are 
Xi = {a, &, cc, t/}, X2 = {0,06,60,66} and X3 = {00,06,60,66} = {0,6}^ (and 
the symmetric one to X^,). Clearly X^ = X}'" = A+, and as we already saw, 
C{X2) = 7I+ \ {6}. Consequently, the centralizer of a four element set can not 
equal to A+ either, as long as X^ ^ A+. 

We conclude our considerations on the commutation with the following re- 
mark. So far we considered only languages not containing the empty word. Con- 
sequently, the centralizer was defined as the maximal semigroup commuting with 
a set. Of course, we can define it also as the maximal monoid commuting with a 
given set (which was actually the original formulation of Conway). There is no 
obvious connections between these problems, as discussed more in irmi . For 
example, the “monoid centralizer” of our set X = {a, ah, ba, 66} in Example 1 is 
{a, 6}* which is not equal modulo the empty word to the “semigroup centralizer” 
{a, 6}^\{6}. Note also that the “monoid centralizer” of a set X C A* containing 
the empty word is always A*. 



5.2 The Conjugacy 

Now, the equation is 



XZ = ZY, (4) 

and we look for its solutions in the family of finite languages. Setting X = Y 
we are in the commutation problem, so that, due to the results of the previous 
section, there is no hope to have a full characterization of solutions of 0). We 
also note that considerations of Fig.l are valid also for the conjugacy - factors 
denoted by x’s and y’s are from the sets Y and X, respectively. We recall that 
X and Y are conjugates if they satisfy O for some Z, and if this is the case, we 



Combinatorial and Computational Problems on Finite Sets of Words 77 

say that X and Y are conjugated via Z. These cases are denoted by X ~ T and 
X Y, respectively. 

There are a number of interesting problems even in the case when only finite 
languages are considered. From the equations point of view we can ask: 

Problem 7. Given a finite Z C A+, characterize all finite sets X, T C A+ such 
that X Y . 

Problem 8. Given finite sets X,Y C A~^, characterize all finite sets Z C A~^ such 
that X ^z Y. 

As a decision question it is natural to ask: 

Problem 9. Decide whether two finite sets X,Y C A+ are conjugated, i.e., 
whether there exists a set Z such that X ^z Y . 

We do not know whether it would make any difference if Z in Problem 9 
would be required to be finite. 

In what follows, we state what we know about these problems. The results 
are from [CKg, and they concentrate to the cases where either card{Z) = 2 or 
card{X) = card{Y) = 2. 

To start with, we recall that 0) always has solutions of form 
X = PQ,Y = QP, and Z = P{QPy 

for some (finite) sets P and Q and some integer i > 0. In fact, Z can be also of 

the form Z = U {PQyP. These are the solutions of word-type of the equation 
i&I 

®- 

Example 2. Let P,Q',P C A+ and Z = U {PQyP for some I QN, and more- 

i&I 

over 

PQ' C X C PQ and QP C y C Q'P. 

Then, as is straightforward to see, P and Q are conjugated via Z, i.e., the 
equation can have also other types of solution than word-type ones. A concrete 
example is obtained by choosing Q = {l,b,bb}, Q' = {1,66}, and Z — P — 
{a, 6a|. Then QP = Q'P and 

PQ' = {a, abb, ba, babb} C (a, ab, abb, ba, bab, babb} = PQ, 

so that by choosing 

X = {a, ab, abb, ba, babb}, Y = {a, abb, ba, babb} 



we see that 






Note that in this example card(X) y card{Y), even if the set Z is almost the 
simplest possible: its cardinality is two and it is a prefix set! 
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The complications of Example 2 cannot occur if we restrict Z to be a biprefix 

set: 

Fact 3 . If Z is a biprefix set and X '^z Y , then necessarily card{X) = cardfY). 

From now on we consider only the cases where either card{Z) = 2 or card{X) 
= cardfY) = 2 . We call these cases binary. Even in these simple cases we do not 
have complete characterizations of Problems 7 and 8 . However, based on Fact 3 
and some more detailed related considerations (for more details see K IKMI 1 we 
have a complete answer to Problem 7 in the case when Z is a binary biprefix 
set. 

Theorem 5 . Let X,Y C be finite and Z C A~^ a biprefix set with card{Z) = 
2 . Then X '^z Y if and only if X and Y can be partitioned into sets 

t t 

X=[jXi andY=\jY, 

i=l i=l 

with card(Xi) = cardfYi) = 2 , for i = 1 , . . . ,t, such that 

X.r^zY, fori = l,... fi. ( 5 ) 

To complete the above characterization we have to deal with the relations in (EJ. 

Theorem 6 . Let X = {x\,X2\ and Y = {yi, 2/2} be binary sets and Z = {zi,Z2} 
a binary biprefix code. Then 

Xr^zY 

if and only if there exists words pi, qi, p2 and 52 and integers mi and m2 such 
that 



xi = pm, yi = iiPi, z = {pm)'^^pi 

X2 = P2Q2, 2/2 = P2P2, Z = (P292)’"^P2, 



and moreover, 

9 i(P 2 g 2 )'"" = {qiPi)""^q2 and 92(^191)’”^ = {q2P2)"'^Pi- 

Now, we turn to Problem 8 in the binary case, that is in the case when both X 
and Y are binary. Our next result shows that the conjugacy of X and Y reduces 
to that of words. It provides a complete characterization, which, however, is not 
easy to state in a closed form. We denote by p{w) the primitive root of w. 

Theorem 7 . Let X = {a:i,a;2} C and Y = {221,2/2} C A+ be conjugates, 
i.e., X ^Y . Then at least one of the following conditions holds true: 

(i) X1X2 = X2X1, 2/12/2 = 2/22/i> and p{xi) p{yi); 

(a) |a:i| = 1x21 and there exists a word z such that, for i = 1 or 2 ,x\ yi and 
either X2 2/3-i or ys-i ^z X2; 

(Hi) \xi\ |x2| and x\ ^ yt and X2 ~ 2 / 3 -i for i = 1 or 2 . 
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Note that the notation x y means that xz = zy, and therefore not neces- 
sarily implies that y x. As we said it is not easy to see directly from Theorem 
7 what is the characterization for the conjugacy of two element sets. However, 
it is obtainable via Proposition 2. 

We conclude this section with two remarks on Problem 9. First of all we do 
not have a solution for it. In fact, we believe that it might contain similar in- 
triguing features as Conway’s Problem. However, we do know that it is decidable 
in the case of biprefix codes. 

Theorem 8. It is decidable whether two finite biprefix codes are conjugates. 

5.3 The Equivalence of Finite Substitutions on Regular Languages 

In this final section we concentrate to the following decision problem: 

Problem 10 (The equivalence of finite substitutions on regular language). Given 
a regular language L C A* and two finite substitutions (f,ip : A* ^ V{B*), 
decide whether or not gi and if are equivalent on L, i.e., 

ip{w) = ifiw) for all w G LI (6) 

It is not difficult to see that this problem can be translated into an equiva- 
lence problem of finite transducers, the translation being based on the fact that 
the set of accepting computations of a finite automaton is regular as well, see 
fTITTq . More precisely. Problem 10 is equivalent to the decision problem asking 
to decide whether two input deterministic gsm’s are equivalent. Here the in- 
put determinism means that the automaton is deterministic with respect to the 
input tape. 

This translation emphasizes the importance of Problem 10, and in particu- 
lar its surprising solution. In see also [HHj . Problem 10 was shown to be 
undecidable even for a fixed simple language L. 

Theorem 9. It is undecidable whether two finite substitutions g> and if are 
equivalent on the language L =^{a,b}* 

The structure of L in Theorem 9 motivates to state a simpler form of Problem 
10 : 

Problem 11. Is it decidable whether two finite substitutions ip, if : — >■ 

V{B*) are equivalent on the language L' =^a* 

The question in Problem 11 is very simple - but apparently amazingly dif- 
ficult. In order to support this view we recall the following. First, as shown in 
there does not exist any finite test set for L', i.e., a finite subset F of L' 
such that to test the equivalence of any two g> and if on L' it would be enough to 
do it on F. Second, some variants of Problem 11 are decidable, and some others 
are undecidable. Decidable variants are obtained by taking some special types 
of finite substitutions, see [KLlj . for example, if the substitution ip is so-called 
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prefix substitution, i.e., for each letter c the set Lp{c) is a prefix set. Actually, in 
this setting the language can be any regular language, and moreover, instead of 
the equality we can ask the inclusion: 

<p{w) C ip{w) for all w G L. (7) 

On the other hand, if we ask the inclusion, i.e., the validity of ((3), for arbitrary 
finite substitutions the problem becomes undecidable even for the language L' : 

Theorem 10. The inclusion problem for finite substitutions ip and ij) on the 
language L' =^A* ^ is undecidable. 

This surprising result was first proved in and later reproved in pmr] 
in a slightly stronger form, that is in the form where <p{c) is of cardinality 2 
for each letter c. This extension yields also the following very simply formulated 
undecidability result: 

Theorem 11. Let X = {a,/3} C A+ be binary, and let Y, Z Q A+ be finite. It 
is undecidable whether or not the inclusion 

A" C 

holds for all n > 1. 

Theorems 10 and 11, and in particular their proofs, reveal surprising facts 
about the computational power of finite sets. For example, the question of Theo- 
rem 11 can be interpreted as a two player game. The first player have two choices 
(a or P) and he chooses an arbitrary sequence of those, i.e., an arbitrary path in 
the complete binary tree. The second player has also a simple strategy: he first 
chooses an element from a finite set Y and then consecutively elements from 
the other finite set Z. The question is whether the second player can always re- 
spond with a sequence being equally long as that one chosen by the first player. 
Theorem 11 asserts that, in general, there is no algorithm to decide whether the 
second player always succeeds (to create the same word). 

Despite of Theorems 10 and 11 our Problem 11 remains still unanswered. 
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Abstract. This is a survey of universality results in the area of Mem- 
brane Computing (P systems), at the level of December 200cQ. We con- 
sider both P systems with symbol-objects and with string-objects; in 
the latter case, we consider systems based on rewriting, splicing, as well 
as rewriting together with other operations (replication, crossingover) , 
with sets or with multisets of strings. Besides recalling characterizations 
of recursively enumerable languages and of recursively enumerable sets 
of vectors of natural numbers, we also briefly discuss the techniques used 
in the proofs of such results. Several open problems are also formulated. 



1 Introduction; The Basic Idea 

The P systems (initially, in I2HI, they were called super-cell systems) were intro- 
duced as a possible answer to the question whether or not the frequent state- 
ments (see, e.g., ini, m) that the processes which take place in a living cell are 
“computations”, that “the alive cells are computers”, are just metaphors, or a 
formal computing device can be abstracted from the cell functioning. As we will 
see below, the answer turned out to be affirmative. 

Three are the fundamental features of alive cells which are basic to P systems: 

(1) the complex compartmentation by means of a membrane structure, where 

(2) multisets of chemical compounds evolve according to prescribed (3) rules. 
A membrane structure is a hierarchical arrangement of membranes, all of 

them placed in a main membrane, called the skin membrane. This one delimits 
the system from its environment. The membranes should be understood as three- 
dimensional vesicles, but a suggestive pictorial representation is by means of 
planar Euler- Venn diagrams (see Figure 1). Each membrane precisely identifies 
a region, the space between it and all the directly inner membranes, if any exists. 
A membrane without any membrane inside is said to be elementary. 

^ An up-to-date bibliography of the area can be found at the web address 
http : //bioinf ormatics .bio . disco .unimib. it /psy stems 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. S2-|1 01 ^ 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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In the regions of a membrane structure we place sets or multisets of objects. 
A multiset is a usual set with multiplicities associated with its elements, in the 
form of natural numbers; the meaning is that each object can appear in a number 
of identical copies in a given region. For the beginning, the objects are supposed 
to be symbols from a given alphabet (we will work with finitely many types of 
objects, that is, with multisets over a finite support-set), but later we will also 
consider string-objects. 



membrane skin elementary membrane 




Fig. 1. A membrane structure and its associated tree 



The objects evolve by means of given rules, which are associated with the 
regions (the intuition is that each region has specific chemical reaction conditions, 
hence the rules from a region cannot necessarily act also elsewhere) . These rules 
specify both object transformation and object transfer from a region to another 
one. The passing of an object through a membrane is called communication. 

Here is a typical rule: 

cabb — >■ caadoutdins, 

with the following meaning: one copy of the catalyst c (note that it is reproduced 
after the “reaction”) together with one copy of object a and two copies of object 
b react together and produce one copy of c, two of a, and two copies of object 
d] one of these latter objects is sent out of the region where the rule is applied, 
while the second copy is sent to the adjacently inner membrane with the label 3, 
if such a membrane exists; the objects c, a, a remain in the same membrane (it 
is supposed that they have associated the communication command here, but 
we do not explicitly write this indication); if there is no membrane with label 
3 directly inside the membrane where the rule is to be applied, then the rule 
cannot be applied. By a command out, an object can be also sent out of the skin 
membrane, hence it leaves the system and never comes back. 
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Therefore, the rules perform a multiset processing; in the previous case, the 
multiset represented by cabb is subtracted from the multiset of objects in a given 
region, objects caa are added to the multiset in that region, while copies of d 
are added to the multisets in the upper and lower regions. 

The rules are used in a nondeterministic maximally parallel manner, the 
objects to evolve and the rules to be applied to them are chosen in a non- 
deterministic manner, but after assigning objects to rules no further rule should 
be applicable to the remaining objects. Sometimes, a priority relation among 
rules is considered, hence the rules to be used and the objects to be processed 
are selected in such a way that only rules which have a maximal priority among 
the applicable rules are used. 

Other features can be considered, such as the possibility to control the mem- 
brane thickness/permeability, but we will introduce them later. 

The membrane structure together with the multisets of objects and the sets 
of evolution rules present in its regions constitute a P system. The membrane 
structure and the objects define a configuration of a given P system. By using 
the rules as suggested above, we can define transitions among configurations. A 
sequence of transitions is called a computation. We accept as successful compu- 
tations only the halting ones, those which reach a configuration where no further 
rule can be applied. 

With a successful computation we can associate a result, for instance, by 
counting the multiplicity of objects which have left the system during the com- 
putation. More precisely, we can use a P system for solving three types of tasks: 
as a generative device (start from an initial configuration and collect all vectors 
of natural numbers describing the multiplicities of objects which have left the 
system during all successful computations), as a computing device (start with 
some input placed in an initial configuration and read the output at the end of a 
successful computation, by considering the objects which have left the system), 
and as a decidability device (introduce a problem in an initial configuration and 
wait for the answer in a specified number of steps). Here we deal only with the 
first case. Many classes of P systems turn out to be computationally universal, 
able to generate exactly what Turing machines can recursively enumerate. 



2 A More Formal Definition of a P System 

A membrane structure can be mathematically represented by a tree, in the 
natural way, or by a string of matching parentheses. The tree of the structure 
in Figure 1 is given in the same figure, while the parenthetic representation of 
that structure is the following: 

Ii [212 Uls [4 [sis [5 Isis [glgle [ 7 ] 7 ] 4 li’ 

The tree representation makes possible considering various parameters, such as 
the depth of the membrane structure, and also suggests considering membrane 
structures of particular types (described by linear trees, star trees, etc). 
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A multiset over an alphabet V = {ai, . . . , a„} is a mapping fi from V to N, 
the set of natural numbers, and it can be represented by any string w G V* such 
that 'Pviw) = (Ai(ai)) • ■ • ) where 'f'y is the Parikh mapping associated 
with V . Operations with multisets are defined in the natural manner. 

With these simple prerequisites, we can define a P system (of degree m > 1) 
as a construct 



n — (P, P, C ^ Wi ^ i^Rl 5 Pi ) , • ■ • 7 (^Rm 5 Pmf) , 



where: 

(i) V is an alphabet; its elements are called objects] 

(ii) T CV (the output alphabet); 

(hi) CQV,Cr\T=^ (catalysts)] 

(iv) ^ is a membrane structure consisting of m membranes, with the membranes 
and the regions labeled in a one-to-one manner with elements of a given 
set H] in this section we use the labels 1, 2, . . . , m; 

(v) Wi,l < i < m, are strings representing multisets over V associated with 
the regions 1, 2, . . . , m of p; 

(vi) Ri,l < i < m, are finite sets of evolution rules over V associated with the 
regions 1, 2, . . . , m of p; pi is a partial order relation over i?j, 1 < f < to, 
specifying a priority relation among rules of Ri. 

An evolution rule is a pair (u,v), which we will usually write in the form 
u — >■ u, where u is a string over V and v = v' or v = v'S, where v' is a string 
over {o/iere, Ooiii , Oirij \ o,€V,l<j< TO.}, and (5 is a special symbol not 
in V. The length of u is called the radius of the rule u ^ v. 

When presenting the evolution rules, the indication “here” is in general omit- 
ted. 

If n contains rules of radius greater than one, then we say that 7T is a system 
with cooperation. Otherwise, it is a non-cooperative system. A particular class of 
cooperative systems is that of catalytic systems: the only rules of a radius greater 
than one are of the form ca — >■ cv, where c G C,a G V — C, and v contains no 
catalyst; moreover, no other evolution rules contain catalysts (there is no rule of 
the form c — >■ v or a — >■ vicv 2 , for c G C). 

The (to -I- l)-tuple (p.,wi, . . . ,iVm) constitutes the initial configuration of 
77. In general, any sequence (p ! with p' a membrane structure 
obtained by removing from p all membranes different from i\, . . . ,ik (of course, 
the skin membrane is not removed), with zc' strings over V, 1 < j < k, and 
{ii, . . . , ik} C {1,2,..., to}, is called a configuration of 77. 

It should be noted the important detail that the membranes preserve the 
initial labeling in all subsequent configurations; in this way, the correspondence 
between membranes, multisets of objects, and sets of evolution rules is well 
specified by the subscripts of these elements. 

For two configurations Ci = (p', . . . ,w(J, C 2 = (p" , w" , . . . , w" ), of 77 

we write Ci C 2 , and we say that we have a transition from C\ to C 2 , if we 
can pass from Ci to C 2 by using the evolution rules appearing in Ri .^ , • . • , Ri^ 
in the following manner. 
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Consider a rule u — >■ x; in a set Ri^. We look to the region of associated 
with the membrane it- If the objects mentioned by u, with the multiplicities 
at least as large as specified by u, appear in then these objects can evolve 
according to the rule u ^ v. The rule can be used only if no rule of a higher 
priority exists in Ri^ and can be applied at the same time with u ^ v. More 
precisely, we start to examine the rules in the decreasing order of their priority 
and assign objects to them. A rule can be used only when there are copies of the 
objects whose evolution it describes and which were not “consumed” by rules of 
a higher priority and, moreover, there is no rule of a higher priority, irrespective 
which objects it involves, which is applicable at the same step. Therefore, all 
objects to which a rule can be applied must be the subject of a rule application. 
All objects in u are “consumed” by using the rule u — >■ u. 

The result of using the rule is determined by v. If an object appears in v in 
the form atere, then it will remain in the same region it- If an object appears 
in V in the form aouu then a will exit membrane it and will become an element 
of the region which surrounds membrane it- In this way, it is possible that an 
object leaves the system: if it goes outside the skin of the system, then it never 
comes back. If an object appears in the form aj„^, then a will be added to the 
multiset from membrane q, providing that the rule xt — >■ v was used in the region 
adjacent to membrane q. If appears in v and membrane q is not one of the 
membranes delimiting “from below” the region it, then the application of the 
rule is not allowed. 

If the symbol 5 appears in v, then membrane it is removed (we say dissolved) 
and at the same time the set of rules Ri^ (and its associated priority relation) 
is removed. The multiset from membrane it is added (in the sense of multisets 
union) to the multiset associated with the region which was directly external to 
membrane it- We do not allow the dissolving of the skin membrane, because this 
means that the whole “cell” is lost, we do no longer have a correct configuration 
of the system. 

All these operations are performed in parallel, for all possible applicable rules 
xt — >■ x;, for all occurrences of multisets u in the regions associated with the rules, 
for all regions at the same time. No contradiction appears because of multiple 
membrane dissolving, or because simultaneous appearance of symbols of the 
form Oout Emd 5. If at the same step we have atn^ outside a membrane i and 
(5 inside this membrane, then, because of the simultaneity of performing these 
operations, again no contradiction appears: we assume that a is introduced in 
membrane i at the same time when it is dissolved, thus a will remain in the 
region surrounding membrane x; that is, from the point of view of o, the effect 
of aim the region outside membrane i and 6 in membrane i is ahere- 

A sequence of transitions between configurations of a given P system II is 
called a computation with respect to II. A computation is successful if and only 
if it halts, that is, there is no rule applicable to the objects present in the last 
configuration. The result of a successful computation is where w describes 

the multiset of objects from T which have been sent out of the system during the 
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computation. The set of such vectors is denoted by Ps{II) (from “Parikh 

set”) and we say that it is generated by U. 

3 Universality Results for Systems with Symbol-Objects 

In this section we recall some results about the generative power of some variants 
of P systems working with symbol-objects. In many cases, characterizations of 
recursively enumerable sets of vectors of natural numbers (their family is denoted 
by PsRE) are obtained. 



3.1 Further Features Used in P Systems 

Before giving these results, we will specify some further ingredients which can 
be used in a P system. They are in general introduced with the aim of obtaining 
more “realistic” systems. For instance, instead of the powerful command inj, 
which indicates the target of the destination membrane, we can consider weaker 
communication commands. The weakest one is to add no label to in: if an object 
ain is introduced in some region of a system, then a will go to any of the adjacent 
lower membranes, nondeterministically chosen; if no inner membrane exists, then 
a rule which introduces Qi„ cannot be used. 

An intermediate possibility is to associate both with objects and membranes 
electrical charges, indicated by -I-, — , 0 (positive, negative, neutral). The charges 
of membranes are given in the initial configuration and are not changed during 
computations, the charge of objects are given by the evolution rules, in the form 
a — >■ b^d~ . A charged object will immediately go into one of the directly lower 
membranes of the opposite polarization, nondeterministically chosen, the neutral 
objects remain in the same region or will exit it, according to the commands here, 
out associated with them. 



St St 




Moreover, besides the action S we can also consider an opposite action, de- 
noted by r, in order to control the membrane thickness (hence permeability). 
This is done as follows. Initially, all membranes are considered of thickness 1. If 
a rule in a membrane of thickness 1 introduces the symbol t, then the membrane 
becomes of thickness 2. A membrane of thickness 2 does not become thicker by 
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using further rules which introduce the symbol r, but no object can enter or exit 
it. If a rule which introduces the symbol S is used in a membrane of thickness 1, 
then the membrane is dissolved; if the membrane had thickness 2, then it returns 
to thickness 1. If at the same step one uses rules which introduce both S and r 
in the same membrane, then the membrane does not change its thickness. These 
actions of the symbols S,t are illustrated in Figure 2. 

No object can be communicated through a membrane of thickness two, hence 
rules which introduce commands out, in, requesting such communications, can- 
not be used. However, the communication has priority over changing the thick- 
ness: if at the same step an object should be communicated and a rule introduces 
the action r, then the object is communicated and “afterthat” the membrane 
changes the thickness. 

Also a variant of catalysts can be considered, with a “short term memory” . 
Such catalysts (we call them bi-stable) have two states each, c and c, and they 
can enter rules of the forms ca — >■ cv, ca — >■ cv (always changing from c to c and 
back) . 

3.2 The Power of the Previous Systems 

Consider now some notations. The family of sets of vectors of natural numbers 
Ps{n) generated by P systems with priority, catalysts, and the actions S,t, and 
of degree at most m > 1, using target indications of the form here, out, in, is 
denoted by N Pm{Pri,Cat,i/ o,5,t)-, when one of the features a S {Pri, Cat, 
<5, r} is not present, we replace it with na. We also write 2Cat instead of Cat 
when using bi-stable catalysts instead of usual catalysts. 

Proofs of the following results can be found in m, ini> M- 

Theorem 1. PsRE = NP 2 {Pri,Cat,i/o,nS,nT) = NP 4 {nPri,Cat,i/o,S,T) 
= NPi{nPri,2Cat, i/o,nS,nT). 

It is an open problem whether or not systems of the type (nPri, Cat, ij o,b, r) 
can characterize PsRE when using less than four membranes. 

3.3 Conditional Use of Rules 

Starting from the observation that in the biochemistry of the cell certain reac- 
tions are enhanced/supressed by certain chemical compounds, such as enzymes, 
catalysts, hormons, in one considers P systems with the rules applicability 
controlled by the contents of each region by means of certain promoters and in- 
hibitors', these promoters/inhibitors are given as multisets of objects associated 
with given sets of rules. A rule from such a set (as usual, placed into a region) can 
be used in its region only if all the promoting objects are present, respectively, 
only if not all the inhibiting objects are present in that region. 

Specifically, a P system (of degree m > 1), with promoters is a construct 

^ (U ) • ■ • 7 7 (.^ 1 , 17 ^ 1 , 1)7 • ■ • 7 7 Pl,ki) ■> ■ ■ • 7 (.f^m,l 7 Pm,l) 7 

...,{R 

m,km I Pm,km)^ 1 
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where all components are as usual in a P system (note that we use neither a 
terminal subset of V, nor catalysts) and Ri^i, . . . , Ri^kaki > 1, are finite sets 
of rules present in region i of the system, while pij are strings over V, called 
promoters, 1 < i < rn,l < j < ki. The rules are of the form a — >■ u, for a G V, 
V G {btar I b G V,tar G {here, out, in}}* . 

The rules are used as usual in P systems, in the maximally parallel manner, 
but the rules from a set Rij can be used only if the multiset represented by p^ j 
is present in region i, for all 1 < i < m, 1 < _) < 

An identical definition holds for systems with forbidding conditions, where 
the rules from Rij are used only if the associated multiset pij is not included 
in the multiset of objects present in the region. 

The maximum ol ki,l < i < m, is called the promoter diversity (resp., 
inhibitor diversity) of the system. The family of all sets Ps{II), computed as 
above by systems U of degree at most m > I and with the promoter diversity 
not exceeding fc > 1, is denoted by PsPm,{i/o,promk). When using inhibitors, 
we replace proruk by inhibk- We stress the fact that we do not use catalysts, 
priorities, or actions S, t, while the communication commands are of the form 
here, out, in. The following results are proved in they show a clear trade- 
off between the number of membranes and the promoter diversity of the used 
systems. 

Theorem 2. PsRE = PsP^{i/ o,prom 2 ) = P sP^{i / o,prom^) = PsP^{i/o, 
prom^) = PsPi{i/o,prom'j) = PsP^{i/o,inhibo). 

It is an open problem whether or not these results can be improved (for in- 
stance, using at the same time less membranes and a smaller promoter/inhibitor 
diversity). 

4 P Systems with String-Objects 

As we have mentioned in the Introduction, is also possible (this was considered 
already in m) to work with objects described by strings. The evolution rules 
should then be string processing rules, such as rewriting and splicing rules. As 
a result of a computation we can either consider the language of all strings 
computed by a system, or the number of strings produced by a system and sent 
out during a halting computation. In the first case one works with usual sets 
of strings (languages), not with multisets (each string is supposed to appear in 
exactly one copy), while in the latter case we have to work with multisets. We 
start by considering the set case. 

4.1 Rewriting P Systems 

We consider here the case of using string-objects processed by rewriting. Always 
we use only context-free rules, having associated target indications. Thus, the 
rules of our systems are of the form (A — >■ v;tar), where A — >■ u is a context- 
free rule over a given alphabet and tar G {here, out, in}U{inj | 1 < j < m}. 
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with the obvious meaning: the string produced by using this rule will go to the 
membrane indicated by tar (j is the label of a membrane). As above, we can 
also use priority relations among rules as well as the actions 6 , r, and then the 
rules are written in the form (X — >■ xa; tar), with a S {<5, r}. 

Formally, a rewriting P system is a construct 

n — (V, P, L\, . . . , Prni (^Pl ; Pi ) ; • ■ ■ ; {Pm : Pm)) : 

where V is an alphabet, T Q V (the terminal alphabet), /r is a membrane 
structure with m membranes labeled with 1, 2, . . . , m, L\, . . . , Lm are finite lan- 
guages over V (initial strings placed in the regions of p,), Pi,. , Pm are finite 
sets of context-free evolution rules, p\, . . . , pm are partial order relations over 

Pi 1 • • ■ 1 Pm • 

The language generated by II is denoted by L{II) and it is defined as follows: 
we start from the initial configuration of the system and proceed iteratively, by 
transition steps performed by using the rules in parallel, to all strings which can 
be rewritten, obeying the priority relations; at each step, each string which can be 
rewritten must be rewritten, but this is done in a sequential manner, that is, only 
one rule is applied to each string; the actions 8 , r have the usual meaning; when 
the computation halts, we collect the terminal strings sent out of the system 
during the computation. We stress the fact that each string is processed by one 
rule only, the parallelism refers here to processing simultaneously all available 
strings by all applicable rules. 

We denote by PPm{Pri, i/o, 8 , r) the family of languages generated by rewrit- 
ing P systems of degree at most m > 1, using priorities, target indications of the 
form here, out, in, and actions 8 , r; as usual, we use nPri, n 8 , nr when appro- 
priate. 

The following result is proved in m for the case of three membranes; the 
improvement to two membranes was given independently in H2I and |2S|- 

Theorem 3. RE — RP 2 {Pri,i/o,n 8 ,nT). 

The powerful feature of using a priority relation can be avoided at the price 
of using membranes with a variable thickness. This was proved first in m. 

without a bound on the number of membranes, then the result has been 
improved in m-- 

Theorem 4. RE = RP/i{nPri,i/o, 8 ,T). 

It is not known whether or not this result is optimal. 

4.2 Splicing P Systems 

The strings in a P system can also be processed by using the splicing operation 
introduced in na as a formal model of the DNA recombination under the influ- 
ence of restriction enzymes and ligases (see a comprehensive information about 
splicing in m)- 
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Consider an alphabet V and two symbols $ not in U. A splicing rule over 
U is a string r = where Ui, U2, U3, M4 G V* {V* is the set of all 

strings over V). For such a rule r and for x,y,w,z G V* we define 

(X,y) \~r (W,z) iff X = XiMiM2a;2, y = 2/1W3M42/2, W = XiUiUiy 2 , Z = yiU 3 U 2 X 2 , 
for some xi,X2,yi,y2 G V* . 

(One cuts the strings x,y in between U\,U2 and ^3,^4, respectively, and one 
recombines the fragments obtained in this way.) 

A splicing P system (of degree m > 1) is a construct 

n — (U, F, /r, A4 , . . . , Ljyi , , . . . , Rm ) , 

where V is an alphabet, T Q V (the output alphabet), ^ is a membrane struc- 
ture consisting of m membranes (labeled with Li,l < i < m, are 

languages over V associated with the regions 1, 2 , . . . , m of Ri, 1 < i < m, are 
finite sets of evolution rules associated with the regions 1, 2, . . . , m of /i, given in 
the form (r; tari,tar2), where r = ui^U2%U3^U4 is a usual splicing rule over V 
and tari,tav2 G {here, out, in}. 

Note that, as usual in splicing systems, when a string is present in a region 
of our system, it is assumed to appear in arbitrarily many copies. 

A transition in II is defined by applying the splicing rules from each region 
of pL, in parallel, to all possible strings from the corresponding regions, and 
following the target indications associated with the rules. More specifically, if 
x,y are strings in region i and (r = ui#U2$W3#rt4; tari, tar2) G Ri such that 
we can have (x,y) hr (w,z), then w and z will go to the regions indicated by 
tari,tar2, respectively. Note that after splicing, the strings x, y are still available 
in region i, because we have supposed that they appear in arbitrarily many copies 
(an arbitrarily large number of them were spliced, arbitrarily many remain), but 
if a string w, z, resulting from a splicing, is sent out of region i, then no copy of 
it remains here. 

The result of a computation consists of all strings over T which are sent out of 
the system at any time during the computation. We denote by L{U) the language 
of all strings of this type. Note that in this subsection we do not consider halting 
computations, but we leave the process to continue forever and we just observe 
it from outside and collect the terminal strings leaving the system. 

We denote by SPm{i/o) the family of languages L{II) generated by splicing 
P systems as above, of degree at most m > 1. 

In |3S| it was proved that 5*^3 (z/o) = RE; the result has been improved in 
m as follows. 

Theorem 5. RE = SP2{i/o). 

4.3 P Systems with Rewriting and Replication 

Recently, in UHl a variant of rewriting P systems was considered, where rules of 
the form X — >■ {ui,tari)\ \ . . . ||(rt„,tar„) are used. When applying this rule to 
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a string, n strings are obtained, by replacing one occurrence of the symbol X 
by Ui, . . . ,Un and replicating the remaining part of the string; the n strings are 
sent to the membranes indicated by the targets tar\, . . . , tar„, respectively. The 
definition of a replication rewriting system and of its generated language is as 
usual in P systems area. 

The family of all languages L{U), computed as above by systems II of degree 
at most m > 1 is denoted by RRPm{i/o). 

The following result is from m, where one solves the problem formulated 
as open in UBI whether or not the hierarchy on the number of membranes gives 
an infinite hierarchy (as expected, the answer is negative). 

Theorem 6. RE = RRPQ{i/o). 

4.4 Rewriting P Systems with Conditional Communication 

In this subsection, we recall the universality results from j^j, where one con- 
siders a variant of rewriting P systems where the communication of strings is 
not controlled by the evolution rules, but it depends on the contents of the 
strings themselves. This is achieved by considering certain types of permitting 
and forbidding conditions, based on the symbols or the substrings (arbitrary, or 
prefixes/suffixes) which appear in a given string, or on the shape of the string. 

First, let us mention that the set of symbols appearing in a string x € V* 
is denoted by alph(x) and the set of substrings of x is denoted by Sub{x). A 
regular expression is said to be elementary if it has the star height at most one 
and uses the union at most for symbols in the alphabet. 

A rewriting P system (of degree m > 1) with conditional communication is 
a construct 



II— (V, T, /r, Li, . . . , Lrm i?i, Pi, Fi, . . . , R^, P^, Fm), 

where the components P, P, /i. Pi, . . . , Pm are as usual in rewriting P systems. 
Pi , . . . , Rm are finite sets of context-free rules over V present in region i of the 
system, Pi are permitting conditions and Fi are forbidding conditions associated 
with region i, 1 < i < m. 

The conditions can be of the following forms: 

1. empty: no restriction is imposed on strings, they exit the current membrane 
or enter any of the directly inner membrane freely. 

2. symbols checking: each Pi is a set of pairs (a, a), a G {in, out}, for a GV, 
and each Fi is a set of pairs (6, nota) , a G {in, out}, for b G V; a string w can 
go to a lower membrane only if there is a pair (a, in) G Pi with a G alph(w), 
and for each (b, notin) G Fi we have b ^ alph{w); similarly, for sending the 
string w out of membrane i it is necessary to have a G alph(w) for at least 
one pair (a, out) G Pi and b ^ alph{w) for all {b, notout) G Fi. 

3. substrings checking: each Pi is a set of pairs {u,a),a G {in, out}, for u G 
P+, and each Fi is a set of pairs {v, nota), a G {in, out}, for v G P+; a 
string w can go to a lower membrane only if there is a pair (u, in) G Pi with 
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u G Sub{w), and for each {v, notin) G Fi we have v ^ Sub{w); similarly, for 
sending the string w out of membrane i it is necessary to have u G Sub(w) 
for at least one pair (u,out) G Pi and v ^ Sub{w) for all {v, notout) G Fi. 

4. prefix/suffix checking', exactly as in the case of substrings checking, with the 
checked string being a prefix or a suffix of the string to be communicated. 

5. shape checking: each Pi is a set of pairs (e, 0 !),a G {in, out}, where e is 
an elementary regular expression over V, and each Fi is a set of pairs 
{f,nota),a G {in, out}, where / is an elementary regular expression over 
V ; a string w can go to a lower membrane only if there is a pair (e, in) G Pi 
with w G L(e), and for each pair (f, notin) G Fi we have w ^ L{f)-, 
similarly, for sending the string w out of membrane i it is necessary to 
have w G L{e) for at least one pair (e, out) G Pi and w ^ L{f) for all 
{f, notout) G Fi. 

We say that we have conditions of the types empty, symb, sub^, prefk, suffk, 
patt, respectively, where k is the length of the longest string in all Pi, Fi. 

The transitions in a system as above are defined in the usual way. In each 
region, each string which can be rewritten is rewritten by a rule from that region. 
Each string obtained in this way is checked against the conditions Pi,Fi from 
that region. If it fulfills the requested conditions, then it will be immediately sent 
out of the membrane or to an inner membrane, if any exists; if it fulfills both 
in and out conditions, then it is sent either out of the membrane or to a lower 
membrane, nondeterministically choosing the direction. If a string does not fulfill 
any condition, or it fulfills only in conditions and there is no inner membrane, 
then the string remains in the same region. A string which is rewritten and a 
string which is sent to another membrane is “consumed” , we do not have a copy 
of it at the next step in the same membrane. If a string cannot be rewritten, 
then it is directly checked against the communication conditions, and, as above, 
it leaves the membrane (or remains inside forever) depending on the result of 
this checking. 

That is, the rewriting has priority over communication: we first try to rewrite 
a string and only after that we try to communicate the result of the rewriting 
or the string itself if no rewriting is possible on it. 

The family of all languages L{F[), computed as above by systems FI of degree 
at most m > 1, with permitting conditions of type a, and forbidding conditions 
of type /3, is denoted by RPm{oi, (3), a,/3 G {empty, symb, patt} U {subk, prefk, 
suffk I k> 1}. When we will use both prefix and suffix checking (each condition 
string can be checked both as a prefix or as a suffix, that is, we do not separately 
give sets of prefixes and sets of suffixes), then we indicate this by prefsuffk. If 
the degree of the systems is not bounded, then the subscript m is replaced by *. 

Proofs of the following results can be found in ^j. Again, a clear trade- 
off between the number of membranes and the power of the communication 
conditions is found. We do not know whether or not these results are optimal 
(for instance, as the number of used membranes); in particular, we do not have a 
proof of the fact that a reduced number of membranes suffice also when checking 
prefixes and suffixes, but we conjecture that such a result holds true. 
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Theorem 7. RE = RP2{patt, empty) = RP2 {empty , sub2) = RP4{sub2, symb) 
= RPQ{symb, empty) = RP^t{prefsuff2, empty). 

4.5 P Systems with Leftmost Rewriting 

Following 0, we now consider a restriction in the use of rules of a rewriting P 
system, of a language-theoretic nature: any string is rewritten in the leftmost 
position which can be rewritten by a rule from its region. That is, we examine 
the symbols of the string, step by step, from left to right; the first one which 
can be rewritten by a rule from the region of the string is rewritten. If there are 
several rules with the same left hand symbol, any one is chosen. 

We denote by Lieft{n) the language generated by a system II in this way 
and by RPm{left), m > 1, we denote the family of all such languages, generated 
by systems with at most m membranes. In view of the previous results, the 
following theorem is expected (but whether or not it is optimal in the number 
of membranes remains as an open problem): 

Theorem 8. RE = RPQ{left). 

4.6 P Systems with Worm-Objects 

In P systems with symbol-objects we work with multisets and the result of a 
computation is a vector of natural numbers; in the case of string-object P systems 
we work with sets of strings and the result of a computation is a string. We can 
combine the two ideas: we can work with multisets of strings and consider as 
the result of a computation the number of strings sent out during a halting 
computation. To this aim, we need operations with strings which can increase 
and decrease the number of occurrences of strings. 

The following four operations were considered in PI (they are slight variants 
of the operations used in |^): rewriting (called point mutation in and IS5I1. 
replication (as in Subsection 4.3, but always producing only two strings), splitting 
(if a G P and ui,U2 G P’*’, then r : a — >■ ui|u2 is called a splitting rule and we 
define the operation x\ax2 =>r {x\Ui^U2X2)), recombination /crossovering (for 
a string z G P“*" we define the operation {xizx2,y\zy2) =^z {xizy2TyizX2))- 
Note that replication and splitting increase the number of strings, mutation 
and recombination not; by sending strings out of the system, their number can 
also be decreased. 

A P system (of degree m > 1) with worm-objects is a construct 

n = (P, /i, Ai, . . . , Am, {Rl, Si, Ml, Cl), ... , {Rm, Sm, Mm, Cm)), 



where: 

- P is an alphabet; 

- /r is a membrane structure of degree m (with the membranes labeled by 
1,2 , ...,m); 
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- Ai,. . . ,Am are multisets of a finite support over V*, associated with the 
regions of /i; 

- for each 1 < i < m, Ri, Si, Mi, Ci are finite sets of replication rules, splitting 
rules, mutation rules, and crossing-over blocks, respectively, given in the 
following forms: 

a. replication rules: (a — >■ Ui\\u 2 ',tari,tar 2 ), for tar i, tar 2 € {here, out} U 
{iUj \ l<j<m}; 

b. splitting rules: (a — >■ tii|u 2 ; tori, tor 2 ), for tari,tar 2 € {here, out} U 
{irij I 1 < j < m}; 

c. mutation rules: (a — >■ u; tar), for tar € {here, out} U {inj | 1 < j < m}; 

d. crossing-over blocks: {z\tari,tar 2 ), for tar\,tar 2 G {here, out} VJ {in j \ 

1 < J < to}. 

The transitions are defined as usual in P systems area, according to the 
following specific rules: A string which enters an operation is “consumed” by 
that operation, its multiplicity is decreased by one. The multiplicity of strings 
produced by an operation is accordingly increased. A string is processed by only 
one operation. For instance, we cannot apply two mutation rules, or a mutation 
rule and a replication one, to the same string. The strings resulting from an 
operation are communicated to the region specified by the target indications 
associated with the used rule. (Note that when we have two resulting strings, 
two targets are associated with the rule.) 

The result of a halting computation consists of the number of strings sent 
out of the system during the computation. A non-halting computation provides 
no output. For a system U, we denote by N{II) the set of numbers computed 
in this way. By NWPm{tar),m > 1, we denote the sets of numbers computed 
by all P systems with at most to membranes. 

In PI it is proved that each recursively enumerable set of natural numbers 
(their family is denoted by nRE) can be computed by a P system as above; the 
result is improved in 1231, where it is shown that the hierarchy on the number 
of membranes collapses: 

Theorem 9. nRE = NWPe{tar). 

It is an open problem whether or not the bound 6 in this theorem can be 
improved; we expect a positive answer. 

Note the resemblance of P systems with worm objects with P systems with 
rewriting and replication (also Theorems 6 and 9 are similar), but also the es- 
sential difference between them: in Subsection 4.3 we have used sets of strings, 
while here we have worked with multisets (and we have generated sets of vec- 
tors of natural numbers). Following the same strategy as in the case of using 
both rewriting and replication, we can consider also other combinations of op- 
erations from those used in the case of worm-objects. The case of rewriting and 
crossovering was investigated in 1221. The work of such a P system is exactly as 
the work of a P system with worm-objects, only the way of defining the result 
of a computation is different: we consider the language of all strings which leave 
the system during the halting computations. 
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Let us denote by RXPm{i/o),m > 1, the family of languages generated by 
such systems with at most m membranes, using as communication commands the 
indications here, out, in (but not priorities and actions <5, t). Somewhat expected, 
we get one further characterization of recursively enumerable languages (the 
proof can be found in El). 

Theorem 10. RE = RXP^{i/o). 

It is an open problem whether or not the bound 5 is optimal. 



5 P Systems with Active Membranes 

Let us now consider P systems where the membranes themselves are involved in 
rules. Such systems were introduced in inoi with also the possibility of dividing 
membranes, and their universaity was proved for the general case. However, 
in El it was proven that membrane division is not necessary. Here we will 
consider this restricted case. (However, membrane division is crucial in solving 
NP-complete problems in polynomial - even linear - time, by making use of an 
exponential working space created by membrane division, but we do not deal 
here with this aspect. A survey of results of this type can be found in m) 

A P system with active membranes, in the restricted form, is a construct 
n = {V,T, H,fJ.,Wi,...,Wm,R), 



where: 

(il m > 1; 

(ii) V is the alphabet of the system; 

(iii) T CV (terminal alphabet); 

(iv) H is a, finite set of labels for membranes; 

(v) /r is a membrane structure, consisting of m membranes labeled with ele- 
ments of H and having a neutral charge (all membranes are marked with 
0 ); 

(vi) wi, ... , Wm are strings over V, describing the multisets of objects placed in 
the m regions of /i; 

(vii) i? is a finite set of rules, of the following forms: 

(a) [^a v]2, ior h £ H, a £ V, v £ V*, a £ {-I-,— ,0} (object evolution 

rules), 

(b) — >■ where a,b £ V, h £ H, a,P £ {-I-,— ,0} (an object is 

introduced in membrane h), 

(c) [^a ] 2 — t [/,] ^b, for h £ H, a,(3 £ {-I-, — , 0}, a,b £V (an object is sent out 
of membrane h) . 

Also rules for dissolving and for dividing a membrane are considered in 
(and in other subsequent papers), but we do not use such rules here. 

The rules are used as customary in a P system, in a maximally parallel 
manner: in each time unit, all objects which can evolve, have to evolve. Each 
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copy of an object and each copy of a membrane can be used by only one rule, 
with the exception of rules of types (a), where we count only the involved object, 
not also the membrane. That is, if we have several objects a in a membrane i 
and a rule [^a — >■ then we use this rule for all copies of a, irrespective how 
many they are; we do not consider that the membrane was used ~ note that its 
electrical charge is not changed. However, if we have a rule [jO]“ — ?> ]^6, then 

this counts as using the membrane, no other rule of types (b) and (c) which 
involves the same membrane can be used at the same time. 

As any other membrane, the skin membrane can be “electrically charged” . 
During a computation, objects can leave the skin membrane (using rules of type 

(c)). 

We denote by N{II) the set of all vectors of natural numbers computed 
as above by a P system II. The family of all such sets of vectors, computed 
by systems with at most m > 1 membranes, is denoted by PsAPm] when the 
number of membranes is not restricted, we replace the subscript m by *. 

As announced above, in it is proved that PsRE = PsAP^, and the 
problem is formulated whether or not the hierarchy on the number of membranes 
collapses at a reasonable level. In H2I it is proved that four membranes suffice 
(it is an open problem whether or not this result is optimal). 

Theorem 11. PsRE = PSAP 4 . 

6 Techniques Used in the Proofs of Universality Results 

The most used tool (almost always used when dealing with P systems with 
symbol-objects) for proving universality results is the characterization of re- 
cursively enumerable languages by means of matrix grammars with appear- 
ance checking in the binary normal form. Such a grammar is a construct 
G = {N,T, S, M, F), where N,T are disjoint alphabets, S G N, M is a, finite 
set of sequences of the form (Ai — >• xi, . . . , A„ — >• a;„), n > 1, of context-free 
rules over U T (with Ai G N,Xi G {N U T)*, in all cases), and F is a set 
of occurrences of rules in M (A^ is the nonterminal alphabet, T is the terminal 
alphabet, S is the axiom, while the elements of M are called matrices). 

For w,z G {N U T)* we write w z if there is a matrix {A\ — > X\^ 
. . . , An — >■ Xn) in M and the strings Wi G {N UT)*,l<f<n-|-l, such that w = 
Wi,z = Wn-i-i, and, for all 1 < z < n, either (1) Wi = w'^Aiw” ,Wi+i = w'^Xiw", 
for some rc', w'/ G (A^UT)*, or (2) Wi = Wi+i, Ai does not appear in Wi, and the 
rule Ai — >■ Xi appears in F. (The rules of a matrix are applied in order, possibly 
skipping the rules in F if they cannot be applied ~ we say that these rules are 
applied in the appearance checking mode.) 

The language generated by G is defined by L{G) = {ic G T* | F =>* w}. 
The family of languages of this form is denoted by MATac- When F = % (hence 
we do not use the appearance checking feature), the generated family is denoted 
by MAT. 

It is known that CF C MAT G1 MATac = RE, the inclusions being proper. 
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A matrix grammar G = (A^, T, 5", M, F) is said to be in the binary normal 
form if N = NiU N 2 U {S', ff}, with these three sets mutually disjoint, and the 
matrices in M are of one of the following forms: 

1. {S ^ XA),with X G Ni,Ag N 2 , 

2. {X ^Y,A^ x),with X,Y G Ni,Ag N 2 ,x G {N 2 UT)*, 

3. (A ^ #), with A,r G lVi,A G A^ 2 , 

4. (A — >■ A, A — >■ x), with A G Ai, A G N 2 , and x GT* . 

Moreover, there is only one matrix of type 1 and F consists exactly of all rules 
A ^ ff appearing in matrices of type 3; # is a trap-symbol, once introduced, 
it is never removed. A matrix of type 4 is used only once, in the last step of a 
derivation. 

According to 0, for each matrix grammar there is an equivalent matrix 
grammar in the binary normal form. 

For an arbitrary matrix grammar G = (A, T, S, M, F), let us denote by ac{G) 
the cardinality of the set (A G N \ A ^ a G F}. ^From the construction in the 
proof of Lemma 1.3.7 in jS| one can see that if we start from a matrix grammar 
G and we get the grammar G' in the binary normal form, then ac(G') = ac{G). 

Improving the result from m ( six nonterminals, all of them used in the ap- 
pearance checking mode, suffice in order to characterize RE with matrix gram- 
mars), in m it was proved that four nonterminals are sufficient in order to 
characterize RE by matrix grammars and out of them only three are used in 
appearance checking rules. Of interest in the P area is another result from IS]: if 
the total number of nonterminals is not restricted, then each recursively enumer- 
able language can be generated by a matrix grammar G such that ac{G) < 2. 

Consequently, to the properties of a grammar G in the binary normal form 
we can add the fact that ac(G) < 2. We will say that this is the strong binary 
normal form for matrix grammars. 

Starting from such a grammar G, in many cases a P system is constructed 
with some membranes simulating the matrices from G which do not contain rules 
to be used in the appearance checking mode, and some membranes associated 
with the symbols which are checked in the matrices which contain rules used in 
the appearance checking mode (that is why the number of nonterminals A which 
appear in rules of the form A — # is so important). This leads to a reduced 
number of membranes, as seen in the theorems from the previous sections. 

Other very useful tools used mainly in the proofs dealing with string-objects 
are the normal forms of Chomsky grammars known to generate all recursively 
enumerable languages. The most important are the Kuroda normal form and 
the Geffert normal form. 

A type-0 grammar G = (A, T, S, P) is said to be in the Kuroda normal form 
if the rules from P are of one of the following two forms: A — >■ x,AB GD, 
for A^B^G^D G A and x G (A U T)* (that is, besides context-free rules we have 
only rules which replace two nonterminals by two nonterminals). 

A type-0 grammar G = (A, T, S', P) is said to be in the Geffert normal form 
if A = {S, A, B, C}, and the rules from P are of one of the following two forms: 
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S — >■ xSy, with x,y & {TU{A, B,C})* , and ABC — >■ A (that is, besides context- 
free rules we have only one non-context-free rule, the erasing one ABC — >■ A, 
with A, B, C being the only nonterminals of G different from S). 

At a more technical level, the proofs of universality start by a matrix gram- 
mar or a grammar in Kuroda or Gefffert normal form, and construct a P system 
which simulates it; the control of the correct behavior of the system is ensured 
by various tricks, directly related to the type of the system we construct. In most 
cases, because only halting computations are accepted, a “wrong” choice of rules 
or of communication of objects is prevented by introducing trap-symbols which 
are able to evolve forever; in the string case, if a terminal alphabet is used, then 
one introduces trap-symbols which just prevent the string to become terminal. 
Another useful trick is to control the communication possibilities; if a rule in- 
troduces an object which must be communicated to a lower membrane, then 
the communication should be possible (for instance, the thickness of the lower 
membranes is not increased). A nice way to control the appearance of a symbol 
to be checked by a rule of a matrix grammar is provided by replication: a copy of 
the string is sent to a membrane where nothing happens except that the compu- 
tation goes forever in the case when the specified symbol appears in the string; 
if this is not the case, then the other copy of the string will continue a “correct” 
development. When we have to work on strings in the leftmost/rightmost man- 
ner, then the rotate-and-simulate technique from the splicing area is useful: we 
simulate the rules of the starting grammar in the ends of the strings generated 
by a P system, and the strings are circularly permuted in order to make possible 
such a simulation in any place of a sentential form of the grammar. 

Still more precise proof techniques (for instance, for synchronizing the work 
of different membranes) can be found in the literature, but we do not recall them 
here. 



7 Final Remarks 

We have considered most of the variants of P systems with symbol-objects and 
with string-objects and we have recalled characterizations of the family of re- 
cursively enumerable sets of vectors of natural numbers or of the family of re- 
cursively enumerable languages. Many other variants, several of them leading to 
similar results, can be found in the literature. We only mention here the gener- 
alized P systems considered in @, cni, nn, the carrier-based systems (where 
no object evolves, but only passes through membranes) m. the P systems with 
valuations m (their power is not know yet), the systems also able to create 
membranes ca, which lead to a characterization of Parikh sets of ETOL lan- 
guages, and the systems also taking into account the energy created/consumed 
by each evolution rule 1^ . 

Note. The work of the second author was supported by a grant of NATO Science 
Committee, Spain, 2000-2001. 
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Abstract. Reversible computing is a paradigm of computation that re- 
flects physical reversibility, and is considered to be important when de- 
signing a logical devices based on microscopic physical law in the near 
future. In this paper, we focus on a problem how universal computers 
can be built from primitive elements with very simple reversible rules. 
We introduce a new reversible logic element called a “rotary element”, 
and show that any reversible Turing machine can be realized as a circuit 
composed only of them. Such reversible circuits work in a very differ- 
ent fashion from conventional ones. We also discuss a simple reversible 
cellular automaton in which a rotary element can be implemented. 



1 Introduction 

Recently, various computing models that directly reflect laws of Nature have 
been proposed and investigated. They are quantum computing (e.g., |‘2f4j ) . DNA 
computing (e.g., m, reversible computing (e.g., HEEI), and so on. Reversible 
computing is a model reflecting physical reversibility, and has been known to play 
an important role when studying inevitable power dissipation in a computing 
process. Until now, several reversible systems such as reversible Turing machines, 
reversible cellular automata, and reversible logic circuits have been investigated. 

A logic gate is called reversible if its logical function is one-to-one. A reversible 
logic circuit is a one constructed only of reversible gates. There are “universal” 
reversible gates in the sense that any logic circuit (even if it is irreversible) can 
be embedded in a circuit composed only of them. A Fredkin gate [3j and a Toffoli 
gate m are typical universal reversible gates having 3 inputs and 3 outputs. 

A rotary element (RE), which is also a reversible logic element, was intro- 
duced by Morita, Tojima, and Imai 0. They proposed a simple model of 2-D 
reversible cellular automaton called P 4 in which any reversible two-counter ma- 
chine can be embedded in a finite configuration. They showed that an RE and 
a position marker as well as several kinds of signal routing (wiring) elements 
can be implemented in this cellular space, and gave a construction method of a 
reversible counter machine out of these elements. An RE is a 4-input 4-output 
reversible element, and has two states. It is a kind of switching element that 
changes the path of an input signal depending on its state. 
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In this paper, we show that any reversible Turing machine can be realized as 
a circuit composed only of REs in a systematic manner. In spite of the simplicity 
of an RE, the circuit obtained here is relatively concise and its operation is easy 
to be understood. Especially, there is no need to supply a clock signal to this 
circuit, and it contrasts sharply with a conventional logic circuit. We also discuss 
a simple reversible cellular automaton called P 3 proposed in m, in which an 
RE can be embedded. 

2 Preliminaries 

2.1 A Reversible Sequential Machine 

We first give a definition of a reversible sequential machine (RSM). As we shall 
see below, a reversible logic circuit composed of REs, as well as an RE itself, can 
be formulated as an RSM. 

A reversible sequential machine (RSM) is a system defined by 

M={Q,S,r,q,,S), 

where Q is a finite non-empty set of states, S and F are finite non-empty sets 
of input and output symbols, respectively, and gi £ Q is an initial state. S : 
QxE — >■ QxTisa one-to-one mapping called a move function (hence 1 17| < \F\). 

We can see that an RSM is “reversible” in the sense that, from the present 
state and the output of M, the previous state and the input are determined 
uniquely. A variation of an RSM M = {Q, S, r,S), where no initial state is 
specified, is also called an RSM for convenience. 

2.2 A Rotary Element and a Reversible Logic Circuit 

A rotary element (RE) is a logic element depicted in Fig. d It has four input 
lines {n, e, s,?n} and four output lines {n' ,e' , s\w'}, and has two states called 
H-state and V-state. All the values of inputs and outputs are either 0 or 1, i.e., 
{n,e, s,w),{n\e’ , s' ,w') £ {0,1}'^. However, we restrict the input (and output) 
domain as {(0, 0, 0, 0), (1, 0, 0, 0), (0, 1, 0, 0), (0, 0, 1, 0), (0, 0, 0, 1)}, i.e., at most 
one “1” appears as an input (output) at a time. Hence, the operation of an RE 
is left undefined for the cases that signal I’s are given to two or more input lines. 
In order to explain its operation, we employ the following intuitive interpretation 
for it. Signals 1 and 0 are interpreted as existence and non-existence of a par- 
ticle. An RE has a “rotating bar” to control the moving direction of a particle. 
When no particle exists, nothing happens on the RE. If a particle comes from 
a direction parallel to the rotating bar, then it goes out from the output line 
of the opposite side (i.e., it goes straight ahead) without affecting the direction 
of the bar (Fig. El (a)). On the other hand, if a particle comes from a direction 
orthogonal to the bar, then it makes a right turn, and rotates the bar by 90 
degrees counterclockwise (Fig. El(b)). 

We can define an RE as an RSM Mre- Since the input (0,0, 0,0) has no 
effect on an RE, we omit it from the input alphabet for convenience. Further, 
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Fig. 1. Two states of a rotary element. 



t = 0 



t = 1 




(a) 



t = 0 



t = 1 




(b) 



Fig. 2. Operations of a rotary element: (a) the parallel case (i.e., the coming direction 
of a particle is parallel to the rotating bar), and (b) the orthogonal case. 



we denote the sets of input and output alphabets of Mre as {n,e,s,w} and 
{n', e', s', w'} instead of {(1, 0, 0, 0), (0, 1, 0, 0), (0, 0, 1, 0), (0, 0, 0, 1)}. 

Mre is defined by 

MnE = ({H, |T|},{?^, e, s,w},{n',e',s',u;'},(5RE), 

where the move function ^re is shown in Table0](for instance, if the present state 
is 3 and a particle comes from the input line n, then the state becomes [T] and a 
particle goes out from w'). We can see that the operation of an RE is reversible. 
It has also a bit-conserving property, i.e., the number of I’s is conserved between 
inputs and outputs, since a particle is neither annihilated nor newly created. 



Table 1. The move function 5 rb of a rotary element Mre- 



Present state 


Input 


n 


e 


s 


w 


H-state: 0 


0rc' 


01/;' 


0 e' 


0e' 


V-state: 0 


0 s' 


0 n' 


0n' 


0 s' 



An RE-circuit is a one composed only of REs satisfying the following condi- 
tion: each output of an RE can be connected at most one input of some other 
(or may be the same) RE, i.e., “fan-out” of an output is not allowed. It is also 
easy to formulate each RE-circuit as an RSM. 
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2.3 Logical Universality of a Rotary Element 

It has been shown that a Fredkin gate can be realized by an RE circuit as in in 
FigOljOl. Since a Fredkin gate is logically universal an RE is also universal. 
However, in the following, we do not use this construction method. 




Fig. 3. Realization of a Fredkin gate as an RE circuit |H]. Small triangles are delay 
elements, where the number written inside of each triangle indicates its delay time. 
(Note that delay elements can also be implemented by using REs as in Fig|2Ia).) 



3 A Reversible Turing Machine Composed of REs 

3.1 A Reversible Turing Machine 

We first define a one-tape Turing machine and its reversible version. We use 
quadruple formalism PJ of Ei Turing machine in order to define a reversible one. 

Definition 1. A one-tape Turing machine (TM) is a system defined by 

T = (Q, 5 ',( 7 o,g/,so,( 5 ), 

where Q is a non-empty finite set of states, S is a non-empty finite set of symbols, 
qo is an initial state (qo € Q), q/ is a final state (qf € Q), sq is a speeial blank 
symbol (sq S S), and S is a move relation which is a subset of {Q x S x S x Q) A 
(Qx{/}x{— , 0 , -b}x( 5 ). Each element of 5 is called a quadruple, and either of the 
form [qr,s,s',qt] G (QxSxSxQ) or [qr,/,d,qt] G (Q x {/} x {-, 0 , -b} x Q). The 
symbols “0”, and denote “left-shift”, “zero-shift”, and “right- shift”, 

respectively. [qr,s,s' ,qt] means that if T reads the symbol s in the state qr, then 
write s' and go to the state qt- [qr, /, d, qt] means that ifT is in the state qr, then 
shift the head to the direction d and go to the state qt- 

Let ai = [pi,bi,ci,p'i\ and 02 = [p2,^2,C2,py be two quadruples in S. We 
say ai and 02 overlap in domain iff 

Pi=P 2 A [ 5 i = 62 V bi = / V &2 = / ]• 

We say ai and 02 overlap in range iff 

Pi = P2 A [ Cl = C2 V bi = / V 62 = / ] • 
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A quadruple a is said to be deterministic (in 5) iff there is no other quadruple in S 
with which a overlaps in domain. On the other hand, a is said to be reversible (in 
S) iff there is no other quadruple in 6 with which a overlaps in range. T is called 
deterministic (reversible, respectively) iff every quadruple in S is deterministic 
(reversible). (In what follows, we consider only deterministic Turing machines.) 



Theorem 1. one-tape Turing machine, there is a reversible three- 

tape Turing machine which simulates the former. 



Theorem 2. |0| For any one-tape Turing machine, there is a reversible (semi- 
infinite) one-tape two-symbol Turing machine which simulates the former. 

Theorem n shows computation-universality of deterministic reversible three- 
tape Turing machines. Theorem |2| is useful for giving a simple construction 
method of a reversible TM out of REs. 



3.2 Constructing a Tape Unit 

In order to construct a reversible one-tape two-symbol Turing machine out of 
REs, we first deign a tape cell module (TC-module) as an RE-circuit. A TC- 
module simulates one tape square of a two-symbol reversible TM. It can store a 
symbol 0 or 1 written on the square and an information whether the tape head 
is on the square or not. It is formulated as an RSM Mxc defined below. 

Mtc = (QtC, -S’tC, ^TC, <5tc) 

Qtc = {{h,s)\h,s G {0,1}} 

Etc = {R, RcO, Rcl, W, Wc, SR, SRI, SRc, SL, SLI, SLc, EO, El, Ec} 

Ftc = {a;' I a; G Utc} 

(5tc is defined as follows, where s G (0, 1} and y G Etc ~ {SRI, SLI}-. 



<5tc((0,s),2/) 


= ((0,s),y') 


(1) 


<^Tc((0, s),SRI) 


= {{l,s),SRc') 


(2) 


<5tc((0, s),SLI) 


= {{l,s),SLc') 


(3) 


(5tc(( 1, 0), i?) 


= ((1,0), RcO') 


(4) 


<5tc((1, 1), 


= ((1,1), Rcl') 


(5) 


(5tc((1,0),1U) 


= ((l,l),lTc') 


(6) 


<5tc((1, 1), W) 


= ((l,0),lUc') 


(7) 


<5tc((1, s),SR) 


= {{Q,s),SRI') 


(8) 


5Tc{{hs),SL) 


= {{Q,s),SLI') 


(9) 


<5tc((1,0),E0) 


= ((l,0),Rc') 


(10) 


<5Tc((l,l),i^l) 


= ((l,l),Rc') 


(11) 



Mtc has the state set {{h, s) | h, s G (0, 1}}. The state {h, s) represents that 
the symbol s is written on the tape square, and that the tape head is on this 
cell (if h = 1) or not (if h = 0). There are 14 input and 14 output symbols. 
In the following construction of an RE-circuit, there are also 14 input and 14 
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Table 2. Fourteen input lines of an RSM Mtc- 



Signal Line 


Meaning of a Signal 


R 


Read the symbol at the head position. 


RcO 


A read operation is completed with the result 0. 


Rcl 


A read operation is completed with the result 1. 


W 


Write a complementary symbol at the head position (i.e., 
if the current symbol is 1, then write 0, else write 1). 


Wc 


A write operation is completed. 


SL 


Shift the head position to the left. 


SLI 


Set the head position at the cell immediately to the left. 


SLc 


A left-shift operation is completed. 


SR 


Shift the head position to the right. 


SRI 


Set the head position at the cell immediately to the right. 


SRc 


A right-shift operation is completed. 


EO 


Reversibly erase the information 0 kept by the finite-state 
control by referring the symbol 0 at the head position. 


El 


Reversibly erase the information 1 kept by the finite-state 
control by referring the symbol 1 at the head position. 


Ec 


A reversible erasure (i.e., merge) operation is completed. 



output lines corresponding to these symbols. The roles of the 14 input lines are 
shown in Table 0 To each input line x G Stc there corresponds an output line 
x' G Itc in the one-to-one manner. 

Mtc nets as follows according to the move function Jtc- 

(I) The case h = 0: (i) If a signal 1 arrives at the input line y G Stc ~ 
{SRI, SLI}, then it simply goes out from y' without affecting the state of Mtc 
by (1). (ii) If a signal arrives at the line SRI (or SLI, respectively), then the 
head position is set to this tape cell, and a completion (i.e., response) signal for 
the shift-right (shift-left) operation goes out from the line SRc' {SLc') by (2) 
(or (3)). 

(II) The case = 1: (i) If a signal 1 arrives at the line R and if s = 0 (or 
s = 1, respectively), then a response signal goes out from RcO' (Rcl) by (4) (or 
(5)), performing a read operation, (ii) If a signal arrives at the line W and if 
s = 0 (or s = 1, respectively), then s is set to 1 (or 0) and a response signal goes 
out from W' by (6) (or (7)), performing an operation of writing a complementary 
symbol, (iii) If a signal arrives at the line SR (or SL, respectively), then h is 
set to 0 and a response signal goes out from SRI' (or SLI') by (8) (or (9)). (iv) 
If a signal arrives at the line EQ (or El, respectively) and s = 0 (s = 1), then 
a response signal goes out from Ec' by (10) (or (11)), performing a “reversible 
erasure” of one bit of information by referring the symbol at the head position 
(usage of this operation is explained later). 

Fig. Elshows a TC-module, a realization of a tape cell Mtc as an RE-circuit. 
In order to explain the operations of a TC-module, it is convenient to consider 
an RE-column shown in Fig. 0(a). It consists of fc-|-l REs, and has 2k input lines 
and 2k output lines {k G {1,2, •••}). We assume all REs except the bottom one 
(indicated by x) are initially set to V-states (the bottom RE may be either H- or 
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V-state). Further assume a signal 1 is given to at most one input line. Then, an 
RE-column acts like an RSM shown in Fig. 0(b), where the input and output 
alphabets are {h, ■ ■ ■ ,lk,r\, ■ ■ ■ , r^} and {l[, ■ ■ ■ ■ ■ ■ , r^}, respectively, and 

the state set is {El, E} which matches that of the bottom RE. Though there 
are fc + 1 REs, we can consider the RE-column as if a two-state machine. Because 
all the REs except the bottom one are reset to V-states when a signal 1 goes 
out from some output line. For example, if it is in the state E the input is 
Ij, then after some time steps the state becomes E gives an output I'y In 
what follows, we write x = 1 (or marked) if it is E , x = 0 (or unmarked) if 

it is E • 




Fig. 4. A TC-module: a realization of a tape cell Mtc as an RE-circuit. 



A TC-module consists of two RE-columns, i.e., left and right ones which 
correspond to h and s respectively. We can verify that the TC-module acts as 
Mtc by testing all the cases of inputs and states. For example, consider the case 
that a signal is given to i?. If h = 0 then the signal eventually goes out from R' 
without affecting the states h and s.lf h = 1, the signal first sets the state h to 0, 
and then enters the third RE of the right RE-column. There are two sub-cases: 
s = 0 and s = 1. If s = 0, the signal goes out from the right side of the third RE 
without affecting s, and enters the fourth RE of the left RE-column. It then sets 
h to I, and finally goes out from RcO'. If s = I, the signal first goes out from the 
left side of the third RE setting s to 0, and enters the second RE of the right 
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Fig. 5. (a) An RE-column, and (b) its move function {j £ {1, ■ • ■ , fc}). 



RE-column. It restores both s and h to 1 and finally goes out from It is 
also easy to verify other cases. 

An entire circuit for a tape unit can be obtained by placing infinite num- 
ber of TC-modules in a row, and connecting input and output lines between 
adjacent TC-modules as shown in the right-half of Fig. El From the move func- 
tion of the TC-module, we can see that by giving a signal 1 to one of the lines 
R, W, SL, SR, EO, or El of the leftmost TC-module in the semi-infinite array, it 
can correctly simulate each operation on the tape unit. 



3.3 Constructing a Finite-State Control 

We now design an RE-circuit that simulates a finite-state control of a given one- 
tape two-symbol reversible Turing machine T. This circuit is called a finite-state 
control module (FC-module). It is constructed in a similar manner given in 0. 

Each quadruple of T performs either a read/write (i.e., R and/or W) opera- 
tion, or a head-shift {SL or SR) operation to a tape unit. In addition, in order to 
realize a FC-module as an RE-circuit, we need a “reversible erasure” operation 
that erases an information kept by the FC-module. This is due to the following 
reason. When a read operation is performed, the information of the read symbol 
(0 or 1) is distinguished by some different states of the FC-module of T. If such 
an information is never erased, then the total amount of the information grows 
indefinitely, and thus an FC-module needs infinite number of states. Hence, such 
information should be reversibly erased (i.e., merged) by referring the read sym- 
bol itself each time a read operation is performed. This operation is done by 
giving a signal to the line EQ or FI of the leftmost TC-module. 

To explain a construction method of an FC-module, we consider a simple 
example of a one-tape two-symbol reversible Turing machine 



T^n — ({ 91 , • • • , qie}, { 0 , 1 }, qi,qi6, 0 , S 2 n) 



110 



Kenichi Morita 



having the following quadruples as S2n- 



[gi, 0, 0, 52 ] 

[Q2, /) +) Qs] 
[ 93 , 0 , 0 , qie] 
[93, 1, 0, 54] 



[94, /, +, gs] 
[gs, 0 , 0 , ge] 
[95, 1 , 1 , 94] 

[< 76 , /, +, <77] 
[97, 0 , 1 , gs] 
[< 77 , 1 , 1 , < 76 ] 



[ 98 , /, +, 99] 
[99, 0 , 1 , 9 io] 

[910, / , +, 9 ii] 
[911, 0 , 0 , 912] 



[912, / , — , 913] 
[913, 0 , 0 , gi 4 ] 
[913, 1 , 1 , 912] 

[914, / , — , 915] 
[915, 0 , 1 , g2] 
[915, 1 , 1 , 914] 



It is easy to verify that T2„ is reversible. It computes the function /(n) = 2 n for 
a unary input as shown in Fig. El 



t = 0 : t = 93 : 




Fig. 6 . Computing the function f{n) = 2 n by a reversible Turing machine T2n- 



An FC-module for T2„ is shown in Fig.O The quadruples of T2„ are executed 
and controlled by a matrix of REs. Namely, each column corresponds to each 
state of T2n, and five rows correspond to five operations of write, read, shift- 
right, shift-left, and reversible erasure. At first, all the REs of the FC-module 
are in H-states. 




Fig. 7. The finite-state control module (FC-module) for T2n- 
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The FC-module executes an operation as follows. First, consider the case of 
a shift-right operation. If the present state of T 2 „ is, e.g., qs, then a signal 1 is 
put on the line qs in the lower part of Fig. 0 The signal goes upward to the RE 
on the row of a shift-right operation. After changing the RE to the V-state, the 
signal turns right and goes out from SR' . If a signal returns back from SRc, it 
restores the state of the RE to the H-state, and goes upward on the column. 

Next, consider the case of a read operation. For example, if the present state 
is qis, a signal 1 goes upward along the line up to the RE on the row of a read 
operation. Changing the RE to the V-state as in the previous case, it turns right 
and goes out from R' . A signal will return back from RcO or i?cl . It then restores 
the state of the RE, and goes upward taking a different path depending on the 
completion signal RcO or Rcl. 

Generally, just after a read operation, a reversible erasure operation should 
be performed. It is in fact an inverse of a read operation. The bottom row of 
REs shown in Fig. 0 realizes this operation. 

Appropriately connecting the vertical lines in the upper part of the figure to 
the ones in the lower part, state transition of T 2 „ is also realized. Further, the 
input line “Begin” is connected to the line of the initial state qi , and the line of 
the final state qie is to the output line “End” . (From the method given in we 
can assume that an initial state of a constructed reversible TM does not appear 
as the fourth element of a quadruple (hence it appears only at time 0)). 

3.4 An RE-circuit Realizing an RTM 

By connecting an FC-module with the tape module appropriately, we can obtain 
the whole circuit of a given reversible Turing machine. Fig. 0shows the RE-circuit 
for T 2 n- By giving a signal 1 to the “Begin” input it starts to compute. 




Fig. 8. The whole RE-circuit realizing T 2 „. 
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4 A Simple Universal Reversible Cellular Automaton 

In pm, a S'^-state reversible partitioned cellular automaton (RPCA) P 3 was 
proposed, and it was shown that any reversible counter machine can be embedded 
in the P 3 space. This improves the previous result in 0 on the number of states. 
Each cell of P 3 has four parts, and each part has a state set {0,1,2}^ (0, 1 
and 2 are indicated by a blank, O and •). Its local transition function is shown 
in Fig. 0 Reversibility of P 3 is verified by checking there is no pair of distinct 
rules having the same right-hand side. Fig. cni shows an RE embedded in P 3 
space, where a single • acts as a signal. Hence, the function of an RE can be 
decomposed into much simpler reversible local rules of P 3 . 




Fig. 9. The set of 13 rule schemes (representing 81 rules) of the rotation-symmetric 3“*- 
state RPCA P 3 . Each rule scheme of (a)-(l) stands for four rules obtained by rotating 
the left- and right-hand sides of it by 0, 90, 180 and 270 degrees. The rule scheme (m) 
represents 33 rules not specified by (a)-(l) {w,x,y,z Gjblank, O, •}). 

(a) 



t = 0 t = l t = 2 t = 3 t = 4 t = 5 I 




t = 0 t = l t = 2 t = 3 t = 4 t = 5 
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5 Concluding Remarks 



In this paper, we showed that any reversible Turing machine can be realized as a 
circuit composed only of rotary elements. If we use logic gates such as AND, OR, 
Fredkin gate, or Toffoli gate, some synchronization mechanism should be pro- 
vided to make two or more incoming signals interact properly. In a conventional 
circuit, it is solved by giving a clock signal. However, in the case of a rotary 
element, there is only one incoming signal to each element, which interacts with 
rotating bar of the element. Therefore it acts in a somewhat asynchronous man- 
ner, and thus there is no need to provide a clock signal. Hence, construction of 
a whole circuit becomes relatively simple. 

There have been known several simple universal models of reversible cellular 
automata (e.g., [5171X110) 1. For example, a Fredkin gate can be embedded in 
the cellular spaces in the references jS| and (Z). Though these models have very 
small number of states, they need an infinite configuaraion to realize a universal 
computer. On the other hand, in the models proposed in a reversible 

counter machine can be realized as a finite configuration by using rotary elements 
and some other elements implemented in these cellular spaces. It is left for the 
future study whether there are still other methods to implement a universal 
computer in a simple reversible cellular space, and whether further simplification 
of a rotary element is possible. 
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Abstract. The equivalence problem for deterministic pushdown auto- 
mata has been shown decidable in ISenQTdSenQTalSenQTbfjenOlhstin!^ . 
We give some applications of this decidability result to other problems 
arising in the following areas of theoretical computer science: 

— programming languages theory 

— infinite graph theory 

— Thue-systems 



1 Introduction 

We describe, in this extended abstract, several applications of the decidability 
of the equivalence problem for dpda (we denote this problem by Eq(D,D) in the 
sequel), in the following areas 

— programming languages theory 

— infinite graph theory 

— Thue-systems 

We have left out of this description applications to other equivalence problems 
in formal language theory; such applications are already described in [SenOll 
p.155-156]. 

What we call an application may be either a true application of theorem Q or a 
result obtained by a method which is a variant of the method used in ySen()l| . In 
many cases the “application” to a problem P will follow from a reduction from 
problem P to Eq(D,D) which already existed in the literature (i.e. the reduction 
was established before Eq(D,D) was solved). In some cases (corollary 0 theorem 
m theorem cnj we shall give a new reduction of some problem P to Eq(D,D). 
In all cases we try to sketch the landscape around every application. 

2 Deterministic Pushdown Automata 

In this section we just recall the basic definitions on dpda and the result about 
decidability of the equivalence problem. More details about either the proof of 
this result or the historical development of the works around this problem can be 
found elsewhere (see |Sen97aj for the solution, section 1 . 1] for a general 

historical outline and |SenflOaj for an historical perspective on the proof-systems 
used in the solution). 



M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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2.1 Pushdown Automata 

A pushdown automaton on the alphabet A is a 7-tuple Ai=<X, Z, Q, S, qo^zo, F> 
where Z is the finite stack-alphabet, Q is the finite set of states, go G Q is 
the initial state, zq is the initial stack-symbol, P is a finite subset of QZ*, 
the set of final configurations, and <5, the transition function, is a mapping S : 
QZx {XU{e}) ^VfiQZ*). 

Let ( 7 , g' G Q,uj,uj' G Z*,z G Z,f G X* and a G A U {e} ; we note 
{qzLo^af) I — {q'^'^if) if q'^' G d{qz,a). i — is the reflexive and tran- 
sitive closure of i — ■ 

f * 

For every quj, q'uj' G QZ* and / G A*, we note quo — q'^' iff /) ' — 
(gV,e). 

Xi is said deterministic iff, for every z G Z,q G Q,x G X : 

Card(i5(g2;,e)) G {0,1} (1) 

Card((5(g0, e)) = 1 Card(5(gz, x)) = 0, (2) 

Card(<5(g2:, e)) = 0 ^ Card((5(gz, a;)) < 1. (3) 

A configuration qoj of Xi is said e-bound iff there exists a configuration q'oj' such 
that (gw, e) i — >m e); q'^ is said e-free iff it is not e-bound. 

A pda M is said normalized iff, it fulfills conditions (see above) and 

( 111 , 0 ,( 0 : 



gozo is e — free 


(4) 


and for every qGQ,zGZ,xGX\ 




q'uj' G 5{qz,x) =>| oj' \< 2, 


(5) 


q'uj' G 5{qz,e) =>| uj' \= 0. 


(6) 



The language recognized by M. is 

L(A1) = {w G A* I 3c G F, gozo —^M 4- 

We call mode every element of QZ U (ej. For every q G Q,z G Z, qz is said 
e-bound (respectively e-free) iff condition 0 (resp. condition 0 ) in the above 
definition of deterministic automata is realized. The mode e is said e-free. We 
define a mapping /r : QZ* — ^ QZ U (ej by 

7x(e) = e and p,{qz ■ oS) = gz, 

for every q G Q, z G Z,uj G Z* . For every c G QZ* U |e}, fj,{c) is called the mode 
of c. The configuration c is said e-bound (resp. e-free) if and only if /r(c) has the 
corresponding property. 
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2.2 The Equivalence Problem 

The equivalence problem for dpda was raised in ESSE]. It is the following deci- 
sion problem: 

INSTANCE: Two dpda A^B, over the same terminal alphabet X. 
QUESTION: L(A) =L{B)7 



Theorem 1. The equivalence problem for dpda is decidable. 



The result is exposed in 

some simplifications. 



«SRUWS emikEi 



and proved in |S^n<l7hlS.^n01j , see in 



3 Programming Languages 



3.1 Semantics 



The equivalence problem for program schemes 

Let us say that two programs P, Q are equivalent iff, on every given input, 
either they both diverge or they both converge and compute the same re- 
sult. It would be highly desirable to find an algorithm deciding this equiva- 
lence between programs since, if we consider that P is really a program and 
Q is a specification, this algorithm would be a “universal program-prover” . 
Unfortunately one can easily see that, as soon as the programs P, Q com- 
pute on a sufficiently rich structure (for example the ring of integers), this no- 
tion of equivalence is undecidable. Nevertheless, this seemingly hopeless dream 
lead many authors to analyze the reason why this problem is undecidable and 
the suitable restrictions (either on the shape of programs or on the mean- 
ing of the basic operations they can perform) which might make this equiv- 
alence decidable. Informally, one can define an interpretation as an “universe 
of objects together with a certain definite meaning for each program prim- 
itive as a function on this universe “ and a program scheme as a “program 
without interpretation” ( jMil70l p.205,lines 5-13]). Several precise mathemati- 
cal notions of “interpretation” and “program schemes” were given and studied 



(|IiSS|,Eiini,EiSZ|, 



ESIZH3,|ISiIZHEl,inni ■ see for a survey) . Many methods for either 

transforming programs or for proving properties of programs were established 
but, concerning the equivalence problem, the results turned out to be mostly 
negative: for example, in IIUP70I p.221, lines 24-26], the authors report that 
“for almost any reasonable notion of equivalence between computer programs, 
the two questions of equivalence and nonequivalence of pairs of schemas are not 
partially decidable”. Nevertheless, two kinds of program schemes survived all 
these studies: 



— the monadic recursion schemes where a special ternary function if-then-else 
has the fixed usual interpretation: in the equivalence-problem for such 

schemes is reduced to the equivalence problem for dpda and in FiTTj a re- 
duction in the opposite direction is constructed, 
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— the recursive polyadic program schemes: in |Coii78alCoii78b|. following a rep- 
resentation principle introduced in |Kos75| . the equivalence-problem for such 
schemes is reduced to the equivalence problem for dpda and conversely. 

Corollary 2. The equivalence problem for monadic recursion schemes (with in- 
terpreted if-then-else^, is decidable. 

This follows from theorem D and the reduction given in mnai. 

Corollary 3. The equivalence problem for recursive polyadic program schemes 
(with completely uninterpreted function symbols) is decidable. 

This follows from theorem Hand the reduction given in pc ;oii78al theorem 3.25] 
or the reduction given in !™il corollary 4.4]. 

Interpretation by trees Let us precise now what a “recursive polyadic” program 
scheme is. Let {F, p) be some ranked alphabet i.e. a set F and a map p : F ^ IN. 
A tree over the ranked alphabet (F, p) is a partial mapping t : (IN — {0})* — l F 
satisfying the following conditions: 

Cl: dom(t) C (IN — {0})* is prefix-closed, i.e. if a-/3 G dom(f), then a G dom(t). 
C2: if p(t(a)) = k, then, for every i G IN, of G dom(t) ^ 1 < i < k. 

We denote by M(F) (resp. M°°(F)) the set of finite trees (resp. the set of 
all trees, finite or infinite) over F. With every symbol / of arity k = p{f) is 
associated a fc-ary operation / : M(F)^ — M(F) defined by /(ti , . ■ . ,tk) is the 
unique tree which has a root labelled by / and which has k subtrees at depth 1, 
the ith subtree being ti. 

A system of algebraic equations E over F is defined as follows: 
let <P = {ipi , . . . , ipn] be a ranked alphabet of “unknowns” (we denote by ki the 
arity of ipi) and let X = {a;i, . . . ,Xm} be an alphabet of “variables”, which are 
0-ary symbols, if is a set of n equations of the form 

(fii{xi,X2,...,Xki) =T^ (7) 

where every Ti is some element of M(F U d},X). A solution of A is a tuple 
(ti , . . . ,t„) G (M“(F U AT))" such that, for every 1 < i < n: 

ii{xi,X 2 , . ■ .,Xm) = fi{xi,X 2 , ■ ■ . , Xm) (8) 

where, ti is the operation obtained by interpreting every symbol / by the cor- 
responding operation / and Ti is the operation obtained by interpreting every 
symbol (pi (resp. /) by the corresponding operation ti (resp. /). E will be said 
in normal form iff every tree Ti has its root labelled by an element of F. In 
such a case E has a unique solution. A program scheme is a pair (E,T) where 
E is an algebraic system of equations over F and T is a particular finite tree 
T G M(F U X). The tree computed by the scheme is then 

t = T{xi,X2, . . .,Xm). 

What corollary 0 precisely means is that the following problem is decidable: 
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INSTANCE: Two program schemes (Ai, Ti), (A'2, T2) (assumed in normal form) 
QUESTION: Do these two schemes compute the same algebraic tree ? 

Let us describe now some other F-magmas which extend (M°“(F), (/) /^f)- 
It turns out that corollary 0 is true, as well, in these structures. 

Interpretation by formal power series 

The idea of this link with formal power series is due to iHina- Let F2 be the 

field with 2 elements (i.e. {ZZ, / 2 2 Z ,+,■)). Let D — F2[[Ai, A2, . . . , U]] be 

the set of formal power series with m + 1 commutative undeterminates and co- 
efficients in F2. We consider the two binary operations f,g over D: 

/(S', T) = Y ■ S'^ + Y^ g{S, T) = I + Y ■ + Y“^ ■ . 

Given an element S G D let us define the associated operation S : D™ D hy 

S{Si,S 2 ,...,S^)=So{Si,S 2 ,...,Sm,Y) 

i.e. the series obtained by substituting Si to the undeterminate Xi and leaving 
Y unchanged. 

Let A be a system of equations 0 over the ranked alphabet F = {/, 5}, with 
p{f) = p{g) = 2. A solution in D of (CJ is a n-tuple (Si, S2, . . . , S„) such that 
for every 1 < f < n: 

S,{X„X 2 ,...,X^)=T,{X„X 2 ,...,Xm) (9) 

where. Si is the operation defined above and Ti is the operation obtained by 
interpreting every symbol ipi (resp. /, g) by the corresponding operation Si (resp 

by f,g)- 

Let us define a map S : M°°{F, {x\,X2, ■ . ■ , Xm}) F2[[Xi,X2 , . . . , Xm, U]] 
by:for every ti,t2 G M°°{F,{xi,X2, ...,Xm}) 

S{xi) = Xp, S(/(ti,t2)) = /(S(ti),S(t2)); S{g{ti,t 2 )) = giS{ti),S{t 2 )). 

One can check that S is an injective homomorphism from the magma 
{M°° {F, {xi, X2, . ■ . , Xm}), f , g) into the magma {F2HX1, X2, . . . , Xm,Y]], f,g). 
Moreover the map S is compatible with substitution, in the following sense: 
Vt e M°°{F,{xi,X2, ■ . . ,Xm},V('Ui, . . .,Um) G (M°°(F, {xi,a;2, . . .,Xm})'^, 

S{i{ui, . . .,Um)) = S(t)(S(rti), . . .,S(Um))- 

(This last property follows from the fact that f,g were chosen to be polynomial 
operators). 

The fact that S is an homomorphism and that it is compatible with substitution 
show that S maps every solution of E in M°°{F,{x\,X2, ■ ■ ■ ,Xm}) to a solu- 
tion of S in F2 [[Ai, A2, . . . , Xm, U]]. The hypothesis that E is normal implies 
that it has a unique solution in F2[[Ai, A2, . . . , Xm, U]] too. In conclusion, two 
algebraic systems of equations Ei,E2 have the same solution (resp. the same 
first component of solution) in F2^X\,X2, ■ ■ ■ ,Xm,Y]] iff they have the same 
solution (resp. the same first component of solution) in M°°(F, A). 
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Corollary 4. The equality problem for series in F 2 HX 1 , X 2 , ■ ■ ■ , Xm,Y]] defined 
by an algebraic system of equations (with polynomial operators f,g and with 
substitution) is decidable. 

Let us notice that for any finite field Fq (with q = p’^ , p prime), one can introduce 
the polynomial operators: 

f{SuS 2 , ...,Sp) = f + YSf + ... + Y^SP + . . . + YPSP ( for / G Fq). 

By the same type of arguments one obtains 

Corollary 5. The equality problem for series in Fq[[Xi, X 2 , Xm,Y]] defined 
by an algebraic system of equations (with polynomial operators {f)f^p^ and with 
substitution) is decidable. 

3.2 Types 

Some works on recursively defined parametric types have raised the question 
whether it is possible to decide if two such types are equivalent or not. It has 
been shown in that this problem is reducible to the equivalence problem 

for dpda. Therefore we can state 

Corollary 6. The equivalence problem for recursively defined types is decidable. 



This follows from theorem Hand the reduction given in mB- 

4 Graphs 

We describe here several problems arising in the study of infinite graphs. Such 
infinite graphs arise naturally in computer science either as representing all the 
possible computations of a program (the infinite expansion of a recursive program 
scheme), or all the possible behaviours of a process, etc... They also arise in 
computational group theory as a geometrical view of the group (the so-called 
Cay ley-graph of the group) . 

When the chosen class of graphs is well-related with pushdown automata, one 
can draw from either theorem ^or its proof method some clue for solving decision 
problems on these graphs. 

4.1 Bisimulations-Homomorphisms 

Let X be a finite alphabet. We call graph over X any pair F = (Vr,Er) where 
Vr is a set and Ep is a subset of Vp x X x Vp. For every integer n G IN, we 
call an n-graph every (n -|- 2)-tuple F = {Vp, Ep, vi,. . . , Vn) where {Vp, Ep) is 
a graph and (ui, . . . ,u„) is a sequence of distinguished vertices: they are called 
the sources of F. 

A 1-graph {V,E,vi) is said to be rooted iff v\ is a root of (V,E). Let F,F' be 
two n-graphs over X. 
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Definition 7 TZ is a simulation from F to F' iff 

F dom(7?.) = Vr, 

2. Vi e e TZ, 

3. \/v,w € Vr,v' G Vr>,x € X, such that (v,x,w) € Ep and vTZv' , 

3w' G Vp' such that (v',x,w') G Epf and wTZw' . 

TZ is a bisimulation iff both TZ and TZ~^ are simulations. 

In the case where n = 0 this definition coincides with the classical one from 

EEHii, ima- 

Definition 8 We eall homorphism from F to F' any mapping: h : Vp ^ Vp/ 
such that h is a bisimulation in the sense of definitional^ 
h is called an isomorphism iff h is a bijective bisimulation. 



4.2 Some Classes of Infinite Graphs 






formal definitions 



Equational Graphs. The reader can find in 
of what an equational graph is. However, theorem 0 below gives a strong link 
with pda. 

We call transition- graph of a pda M, denoted T{M), the 0-graph: 

T{M) = {Vr(M),ET{M)) where Vp(M) = | g G Q, w G gw is e - free} 

and 

Et[A) = {{c,X,c) G Vr{M) X ^TiM) I C %M c'j. (10) 



We call computation Fgraph of the pda AI, denoted {C{A4),vm), the subgraph 
of T(AI) induced by the set of vertices which are accessible from the vertex qozo, 
together with the source v_m = qoZg. 



Theorem 9. Let F = (Fq,vq) be a rooted Fgraph over X. The following con- 
ditions are equivalent: 



F F is equational and has finite out-degree. 

2. F is isomorphic to the eomputation Fgraph {C{M.),vm) of some normalized 
pushdown automaton A4. 



A proof of this theorem is sketched in fSen00b| Annex, p. 94-96]. 



Algebraic Trees. Let {F, p) be some ranked alphabet i.e. a set E and a map 
p : F — >■ IN. The notion of a tree over the ranked alphabet (F, p) has been recalled 
in With such a tree one associates a new alphabet 

y = {[fM\f^F,i<k<p{f)}. ( 11 ) 

Let a G dom(t) : a = iii 2 . . - in- The word associated with a is : brch{a) = 
[/oj *i][/i 1 * 2 ] • ■ • [/n-i) in] where aj is the prefix of a of length j and fj = t{aj). 
The branch-language associated to t is then: 



Brch{t) = {brch{a) \ a G dom(t)}. 





Some Applications of the Decidability of DPDA’s Equivalence 121 



One can easily check that, if t, t' are trees over F, with no leaf, then t, t' are 
isomorphic iff Brch(t) = Brch{t'). The notion of algebraic tree over F can be 
defined in terms of algebraic equations in the magma of (infinite) trees over F\ 
t G M{F) is algebraic iff there exists some normal system of n equations S 
over F (see equation o in in such that, t is the first component of its unique 
solution. Anyway, the following characterization is sufficient for our purpose 

Theorem 10. Let t be a tree over the ranked alphabet (F,p), without any leaf, 
t is algebraic iff the language Brchft) is deterministic context-free. 

A more general version valid even for trees having some leaves is given in 
Theorem 5.5.1, point 2]. The trees considered above can be named ordered trees 
because the sons of every vertex a are given with a fixed order: al, o2, . . . , ak. 
We call unordered tree the oriented graph un(t), with vertices labelled over 
F and unlabelled edges, obtained by forgetting this ordering of the sons in an 
ordered tree t. An algebraic unordered tree is then a tree of the form un{f) for 
some algebraic ordered tree t. 



Automatic Graphs. The general idea underlying the notion of an automatic 
graph is that of a graph whose set of vertices is fully described sy some rational 
set of words and whose set of edges is fully described by some rational set of 
pairs of words. Several precise technical definitions following this general scheme 
have been studied in ISen92IKlN95IEel97ICS99IBG(J01Vlor(J0ILv0(Jb|Ly0(J^ . This 
idea appeared first in the context of group theory in [KCH+92] . 



Definition 1. A deterministic 2-tape finite automaton (abbreviated 2-d.f.a. in 
the sequel) is a 5-tuple: 

M=<X,Q,S,qo,F> 

where 

— X is a finite alphabet, the input- alphabet 

— Q is a finite set, the set of states 

— qo is a distinguished state , the initial state 

— F Q Q is a set of distinguished states , the final states 

— 6, the transition function, is a partial map from Q x (AU{#,e})^ to Q 
(where is a new letter not in X) fulfilling the restrictions: 

1. 'iqG Q,S{q,e,e) is undefined, 

2. 'iq G Q,Vm G (A" U {#})^,Va G A U {#}, if 6{q,e,a) is defined, then 

S{q, u) is defined => u = (e, 6), for some b G X U {#} 

3. Vq G Q,Vit G (A U {#})^,Va G A U {#}, if 6{q,a,e) is defined, then 

6{q, u) is defined u = (6, e), for some b G X U {#}. 
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The notation q 4-tvi q' means that there is some computation of the automa- 
ton M starting fron state q , reading the input (mi, M 2 ) € (XU{#})* x (XU{#})* 
and ending in state q' . 

The language recognized by is: 

L(M) = {(Ml, M 2 ) e A* X X*, I 3q € F,qo q}. 



We are interested in some restrictions on 2-d.f.a. 
Let us consider the four situations: 

3q G Q,q' G F,pi,p 2 , ui ^ e, si, S 2 G X* such that 



(P1,P2) (ui,e) (si#,S2#) / 

9o — Q — >M Q — Q 
3q G Q,q' G F,pi,p 2 , M 2 ^ e, si, S 2 G X* such that 



( 12 ) 



(Pl,P2) (S1#,S2#) / 

9o — Q — >M Q — Q 
3?!, G Q,q' G F, wi,W 2 ,Pi,Ui ^ e,Si G X* such that 

(u)i,u) 2 #) (Pl,e) / («l.e) / (si#.e) , 

qo — >M qi — >M Qi — >M Qi — >M q 

3^2, 92 & Q^q' & wi,W 2 ,P 2 , U 2 e, S 2 € X* such that 



(13) 



(14) 



(wi#,W2) (e,P 2 ) / U,U2) , (e.S 2 #) / 

qo >M 92 92 92 9 



(15) 



The 2-d.f.a. Ai will be said strictly balanced iff neither of situations (Il2ll3ll4llbt 
is possible. 

The 2-d.f.a. Ai will be said balanced iff neither of situations (11:^1111151) is possible. 



Given a deterministic graph F = (V, E) on an alphabet X we call it complete 
if, for every vertex v and word u G X* there exists a path 7 in T starting from 
V and labelled by the word u. We denote by m © m the unique vertex which is the 
end of this path 7 . 

Definition 2. Let F = (V,E,vi) be a 1-graph which is deterministic , complete 
and such that vi is a root. We call rational structure on F every (\X\-t-2)-tuple 
of finite automata S = {W,Mf^,{Mx)x^x) such that 

1. W is a one-tape deterministic finite automaton on X* such that 
Vm G X* ,3w G L(W), Ml 0 M = Ml 0 re 

2. Mg is a 2-d.f.a. which is strictly balanced , and 

L(Mg) = {(tMi, ZM 2 ) G L(W) X L(W) I Ml © IMi = Ml © m; 2 } 

3. for every letter x G X, is a 2-d.f.a. which is balanced , and 
L{Mx) = {{wi,W2) G L(W) X L(W) I Ml © wix = mi © m; 2 } 



Theorem 11. F = (V,E,vi) be a 1-graph which is deterministic and complete. 
Then F has a rational structure S. 

Given a normalized d.p.d.a. such that F is the computation 1-graph of A4, one 
can compute such a rational structure S. 
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4.3 Decision Problems 

Let us investigate several decision problems over infinite graphs. 

The isomorphism problem for equational graphs: 

INSTANCE: Two equational graphs AjEj- 
QUESTION: Is A isomorphic with I 2 ? 

This problem has been solved by “model-theoretic” methods in [Cou811CouH()a| 
(hence by a method quite different, by nature, from the method used for solv- 
ing the equivalence problem for dpda in [Senf)7clSen()HStif)f)| L Let us mention 
that the subproblem where Ui ,/2 are supposed rooted and deterministic, can 
be solved as a corollary of the dpda’s equivalence problem: it reduces to the 
property that each graph Ti is a homomorphic image of the other and this last 
problem is solved below by a corollary of the dpda’s equivalence problem (the- 
orem rnrii . 

The isomorphism problem for algebraic ordered trees: 

INSTANCE: Two algebraic ordered trees Ti,T 2 - 
QUESTION: Is T\ isomorphic with T 2 ? 



Corollary 12. The isomorphism problem for algebraic ordered trees is decid- 
able. 

This follows from theorem 0 and the reduction given in KJou83L theorem 5.5.3 
p.l58]. 

The bisimulation problem for rooted equational graphs of finite out- 
degree. 

INSTANCE: Two rooted equational graphs of finite out-degree A; -^ 2 . 
QUESTION: Is A bisimilar to A? 



Theorem 13. The bisimulation problem for rooted equational graphs of finite 
out-degree is decidable. 

This kind of problem has been studied for many classes of graphs (or processes), 
see for example !HHm) . frCT] . p??TT] . [KiHlVUl.-fj . Hffl, |HIM,|neSHn|, 
psnnsi, »], iHiinzi, .The case of rooted equational graphs of finite 

out-degree was raised in im?H! ( see Problem 6.2 of this reference ) and is a 
significant sub case of the problem raised in |Sti96j (as the bisimulation-problem 
for processes “ of type -1”). It is solved in fSen98alSen00aj by a method derived 
from the one used to solve the equivalence problem for dpda. 

The isomorphism problem for algebraic unordered trees. 

INSTANCE: Two algebraic unordered trees A,T 2 . 
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QUESTION: Is T\ isomorphic with T 2 ? 

Theorem 14. The isomorphism problem for algebraic unordered trees is decid- 
able. 

This result can be proved by a variant of the solution of the bisimulation problem 
for rooted equational graphs of finite out-degree. 

Key idea: 

Let us consider two algebraic unordered trees Ti, T 2 . Their vertices are labelled 
on a ranked alphabet {F,p). We suppose that these trees have no leaf (one can 
easily reduce trees in such a normal form, by a transformation which preserves 
algebraicity and in such a way that isomorphic trees have isomorphic normal 
forms). Let us consider the two trees T(, T 2 , which are deduced from Ti,T 2 just 
by removing the labels on the vertices but encoding then the label of a vertex by 
labels, (we use the alphabet Y described in (|TT|l L on the edges getting out of this 
vertex. The set of all words labelling a prefix of some branch of T/ (1 < z < 2)is 
just the language Brch{Ti), which is recognized by some dpda Ai. Hence T' is 
the infinite unfolding of the computation 1-graph Ti of Ai- Note that T^ (for 
1 < i < 2), is a rooted deterministic 1-graph over the alphabet Y and it is 
equational, by theorem 0 An equivalence relation zy on T is defined by: 

V/,/' e T,f e [i,p{f)],j e [i,p{f )],{[/ ,i]v[f,j] f = /')• 



Definition 15 Let T T' be two deterministic n-graphs over the alphabet Y . TZ 
is a zy-permutative bisimulation from T to T' iff 

1. dom(7?.) = Vr, im(7?.) = Vr> 

2. Vz e € n, 

3. € TZ, there exists a bijection hy^ from E{v) = {{v,y,w) G Er} onto 
E{v') = {{v' ,y' ,w') G Ep'} such that, for every y €Y,w € Vp-’ 
hy^yi{v,y,w) = {v',y',w') ^ (zyzyzy'). 

This notion of zy-permutative bisimulation is close to the notion of zy-bisimulation 
considered in ( |Sen98al Definition 3.1], jSenOQbl Definition 23]), so that an adap- 
tation of the techniques can be easily done. 

The quotient problem for rooted deterministic equational 1-graphs. 



INSTANCE: Two rooted deterministic equational 1-graphs A,T 2 . 
QUESTION: Does there exist some homomorphism from Ti to T 2 ? 

Theorem 16. The quotient problem for rooted deterministic equational 1-graphs 
is decidable 

This result can be derived from theorem 1 1 1 1 a.bove and a more general result 
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Theorem 17. Let Ti,T 2 be two rooted deterministie 1-graphs. Let us suppose 
that Ti is given by an automatic structure S and that Tj is the computation 
1-graph of a given dpda Ai. One can then decide whether there exists a graph 
homomorphism : Ji — >■ l 2 . 

Sketch of proof: Let S = For every a G X U {e}, let us 

call a-decomposition, any 2n-tuple {ui,u[,U 2 ,U 2 , ■ ■ ■ , Un, u'„) of words such that 
there exists a computation 



90 




— ^ 9i 







(16) 



and 



Ui • U2 • • • Mn G u[ • U2 • • • G X*#. 



Let M =< X,Z,Q,S 2 ,qo,zo,F >. We then define, for every a G X U {e}, the 
languages 

La = {ui 0 u'i<>u 20 u 2 ^ . . . OunOu'„OOq I {ui,u'i,U 2 ,U 2 , . . . , tt„, is some 
a-decomposition and go^o q<^ where q G Q,u> £ Z*}. (17) 



L), = {ui0u^0u20m2^ . . . Ou„Ou'„OOq \ {ui,u[,U 2 ,U 2 , ■ ■ ■ , u„, u(j) is some 
a-decomposition and go^o ^ ^ gw where q G Q,uj G Z*}. (18) 

(Here O is just a new letter not in X U {#})• 

One can check that there exists some homomorphism /i : Ti — ^ T 2 iff 

1. \/u G X*,3w G L(W), such that qgzo u = qozo ©^2 to 

2. for every a G X U {e}, La = L(j. 

Point 1 reduces to test the equality between the two rational languages over the 
alphabet QU Z: 

{gw G QZ* I 3u G X*,qozo — Gm 9 w},|gw G QZ* \ 3w G L(W),goZo — 9 w} 



which can be effectively done. 

Point 2 amounts to test a finite number of dpda equivalences, which can be ef- 
fectively done, by theorem Q1 □ 



5 Thue-Systems 

5.1 Abstract Rewriting Systems 

Let E be some set and — some binary relation over E .We shall call — > 
the direct reduction.We shall use the notations for every integer {i > 0), 
and — in the usual way (see fHueSOj l. By < — >•, we denote the relation 
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— > U i — . The three relations — and < * > are respectively the reduction, 
derivation and the equivalence generated by — >. 

We use the notions of confluent relation and noetherian relation in their usual 
meaning l lHue80l or dMl §4, p.266-269]). 

An element e G if is said to be irreducible{Tesp. reducible) modulo ( — >) iff 
there exists no (resp. some) e' G E such that e — > e' . By Irr( — >■) we denote 
the set of all the elements of E which are irreducible modulo ( — l). 

Given some subset A of E, we use the following notation: 

< A > . = {e G E \ 3a G A, a e}, [A]^. = {e G E \ 3a G A,a e} 



5.2 Semi-Thue Systems 

We call a subset S C X* x X* a semi-Thue system over X. By — >s we 
denote the binary relation defined by: V/, g G X* , 

/ — >s 9 iff there exists {u, v) G S,a, j3 G X* such that / = aufl, g = avfl. 

— ^5 is the one-step reduction generated by S. All the definitions and notation 
defined in the § “abstract rewriting systems” apply to the binary relation — >$■ 
Let us give now additional notions, notation and results which are specific to 
semi-Thue systems. 

We use now the notation Irr(S') for Irr( — >s)- We define the size of S' as 
||S|| = -I- |u|. A system S is said strictly length-reducing (strict, for 

short) iff, V(u, u) G S, |u| > |u|. 

Definition 3. Let us consider the following conditions on the rules of a semi- 
Thue system S: 

Cl : for every {u, v), {u' , v') G S and every r' , s' G X* 

V = r'u's' =>| s' 1=1 r' |= 0 

C2 : for every {u, v), {u' , v') G S and every r, s' G X* 
rv = u' s' =^\ s' 1= 0 or I s' |>| v \ 

C3 : for every {u, v), {u' , v') G S and every r' , s G X* 
vs = r'u' =^\ r' 1= 0 or | r' |>| r | 

S is said special iff'i{u,v) G S, | r |= 0 
S is said monadic iff\/{u,v) G S, | r |< 1 
S is said basic iff it fulfills Cl, C2 and C3 
S is said left-basic iff it fulfills Cl and C2. 

Each condition Cfli G [1,3]) consists in the prohibition of some superposition 
configuration for two redexes {r,u,v),{r' ,u' ,v') of S. 

Condition Cl : Cl expresses the prohibition of the following configuration: 

V = r'u' s' where | it' |<| r | 
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schema 1: 




In other words, a righthand side of rule may not strictly embed any lefthand 
side of rule. 

Condition C2 : C2 expresses the prohibition of the following configuration: 
rv = u' s' where 0 <| s' |<| | 

schema 2: 



r V 




In other words, a righthand side of rule may not be “strictly overlapped on the 
left” by any lefthand side of rule. 

Condition C3 : C3 expresses the prohibition of the following configuration: 
vs = r'u' where 0 <| r' |<| z; |. 

schema 3: 




3 



In other words, a right-hand side of rule may not be “strictly overlapped on the 
right” by any left-hand side of rule. 

These definitions appeared in LNiv7()l(Joc71lhiut73Hak7hl . We refer the reader 
to fl).lhllE()11.3] for more information on rewriting systems and [Sen h4j for links 
with formal language theory. 

5.3 Decision Problems 

It is well-known that the confluence property, for a semi-Thue system S, can 
be decided, provided that the relation — >s is noetherian. Let us consider the 
following “weak confluence” property: 



s 



( 19 ) 
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A system S fulfilling (113 for some irreducible word /, is said confluent over the 
class of f. Such a property deserves some interest because, 

— the language < / can be recognized in linear time, as soon as S is 

strict and finite (this motivation remains valid even for some graph rewriting 
systems, see |AOpS93| 1. 

— in the case where X* / — >s is a group, the confluence property over the class 
of the empty word, e, is sufficient to ensure decidability of the word-problem. 

Let us call Class Confluence Problem (CCP for short), the following decision- 
problem: 

INSTANCE A finite alphabet X, a noetherian semi-Thue system S over X and 
a word / S X*, which is irreducible modulo S. 

QUESTION Is S confiuent on the class [/], 

Several works have been devoted to the decidability/complexity of problems 
similar to CCP (EliHSl, IAhiSs7l . ITmTI . K7m\ . ITH^ . 

IMN091I . lOttfi'ial . lOttfi'ihl . I7H^ . Ennni, IMN 0/931 . lEIgMI , IS,^n9shn . 
In particular it is known that CCP becomes 

— undecidable for strict, finite, semi-Thue systems ( |Qtt92b| l 

— decidable in polynomial time for strict, finite, basic semi-Thue systems 
( IS^n98bl l. 

Corollary 18. The Class Confluence Problem is decidable for strict, finite, left- 
basic semi-Thue systems. 

This corollary follows from theorem Q and the reduction given in |Sen90l the- 
orem 5.17]. In the context of the two above complexities obtained for general 
semi-Thue systems and for basic semi-Thue systems, it is a natural challenge 
to determine the complexity of the CCP for strict, finite, left-basic semi-Thue 
systems. It is not even known, at the moment, whether this problem is primitive 
recursive or not (from this point of view, its status is the same as for the dpda’s 
equivalence problem, since the reduction given in ISen9L)l is in DEXP-time and 
a converse reduction, in DEXP-time too, is given ) . 
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Abstract. This paper presents a survey of fundamental concepts and 
main results in studying the equivalence problem for computer programs. 
We introduce some of the most- used models of computer programs, give 
a brief overview of the attempts to refine the boarder between decidable 
and undecidable cases of the equivalence problem for these models, and 
discuss the techniques for proving the decidability of the equivalence 
problem. 



Informally, the equivalence problem is to find out whether two given programs 
have the same behavior. By picking various formal definitions of the terms “pro- 
gram” and “behavior” we get numerous variants of this problem. The study of 
the equivalence problem for computational models is always of basic interest in 
computer science beginning with the original works on the development of the 
formal concept of computer program \ 2 mUtUmi Ij . Tackling the equivalence 
problem we comprehend to what extent the very nature of computations can be 
conceived by means of formal methods and how much specific changes in the 
structure of a computer program affect its behavior. The understanding of re- 
lationships between the syntactic and semantic components of programs is very 
important for specification, verification and optimization of programs, partial 
computations, reusing of programs, etc. That is why this problem significantly 
influences both the theory and the practice of computer science and software 
engineering. 

The decidability of the equivalence problem essentially depends on the ex- 
pressive power of a computational model and the exact meaning of the term “the 
same behavior” . Two fundamental results of 50th governed the advancement in 
studies of the equivalence problem for computer programs. 

When programs under consideration are deterministic, it is quite reasonable 
to assume that two programs have the same behaviour if they compute the same 
function, i.e. for every valid input they output identical results (if any). A map- 
ping which associates each program tt with the recursive function computed 
by TT is called enumeration of recursive functions. If a programming system PS 
has an effective interpreter that can simulate every program by presenting its 
description as a part of the input, then enumeration is called computable. If, 
moreover, a programming system PS is such that any other programming sys- 
tem PS' corresponding to computable enumeration can be effectively translated 
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into PS then the enumeration specified by PS is called acceptable (see gani). 
In 1953 H.G.Rice proved that if a programming system PS corresponds to 
an acceptable enumeration of recursive functions, then any non-trivial property 
of computer programs which refers only to functions computed by programs 
is undecidable. This implies the undecidability of the functional equivalence of 
computer programs for every natural universal programming system. 

On the other hand, in 1956 A. A. Lyapunov and Y.I.Yanov introduced a 
propositional model of sequential programs (Yanov schemata) and set up the 
equivalence problem for this model. Yanov scheme may be thought of (see ini ') 
as a finite transition system whose nodes are labelled with statements Ai, . . . , An 
and basic propositions pi, ■ ■ ■ ,Pm- Each statement A is associated with a set of 
basic proposition Sh{A). Nodes marked with statements are called transformers; 
each transformer has a single outgoing arc. Nodes marked with propositions are 
called recognizor; each recognizor has two outgoing arcs labelled with T and 
_L. Transformers are understood as abstractions of program statements, whereas 
basic propositions stand for logic conditions. Two specific nodes — entry and 
exit are distinguished. A run of a Yanov scheme tt starts from the entry given 
some initial evaluation (5*^ of basic propositions. When a run passes via a trans- 
former labelled with A, the statement A is executed by changing arbitrary the 
values of propositions from Sh{A). When a run passes via a recognizor labelled 
with basic proposition p, it checks the current value of p and follows the arc 
marked with this value. A run terminates as soon as it reaches the exit. A de- 
terminant of a run r is a sequence of pairs (5°, A^), (J^, A^), . . . , (5"“^, A"), . . ., 
where A^, A^, . . . , A”, . . . is a sequence of statements executed during this run, 
and <5*, i > 1 is the evaluation of basic proposition after executing AL A de- 
terminant det(Tr) of a Yanov scheme tt is said to be a set of determinants of all 
possible terminated runs of tt. Two schemata tti and tt 2 are called equivalent if 
det{TTi) = det{TT 2 ). In 1957 Yanov m found a complete axiomatization (equa- 
tional calculus) for the equivalence relation thus defined and proved that the 
equivalence problem for this computational model is decidable. 

Both results. Rice’s theorem and the decidability of equivalence problem 
for Yanov schemata, were the pioneering steps from the opposite sides towards 
the boarder between decidability and undecidability of the equivalence prob- 
lem for computational models. Since that time the refinement of this boarder 
has become the topic for a large body of research. In succeeding years a wide 
variety of devices defining computations, languages and translations were intro- 
duced |45l.'-iiSft)p,3t!f,'-i5j . Every time when an all-new model of computation made 
its appearance, the equivalence problem for this model challenged researchers. 
Considerable effort devoted to this problem yielded an amazing amount of re- 
sults and techniques that substantially extend the capability of formal methods 
in computer science. 

Now it is hardly possible to cover in a single survey all main results and ap- 
proaches to the equivalence problem for the most important models of computa- 
tion. In this paper we consider in some details the current state of art in studying 
the equivalence problem for two computational models — propositional sequen- 
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tial programs and first-order sequential programs. An overview of the techniques 
for proving the decidability of language equivalence and relational equivalence 
for devices defining languages and translations (relations) can be found in 0. 
Since the publishing of this paper, some new results has been obtained 
that substantially revised our understanding of decidability /undecidability fron- 
tier for the equivalence problem. A broad spectrum of results on the bisimilation 
equivalence problem for automata and process algebras is discussed in much 
details in m- 

1 Propositional Sequential Programs 

In this section we introduce the concept of a propositional sequential program 
(PSP), its syntax, and semantics and discuss the principal results on the equiv- 
alence problem for this model of computations. 



1.1 Syntax of PSP 

Fix two finite alphabets A — {a^, . . . , a^} and P — {pi, . . . ,Pm}- 

The elements of A are called basic actions. Intuitively, the basic actions 
stand for the elementary program statements such as assignment statements 
and procedure calls. A finite sequence of basic actions is called a term. The set 
of all terms is denoted by A*. We write A for the empty term, \h\ for the length 
of a term h, and hg for the concatenation of terms h and g. 

The symbols of P are called basic propositions. We assume that basic propo- 
sitions denote the primitive relations on program data. Each basic proposition 
may be evaluated either by T (falsehood) or by T (true). A binary M-tuple 
<5 = (di, . . . ,d,M) of truth-values of all basic propositions is called a condition. 
We write C for the set of all conditions. 

A propositional sequential program (PsP) over alphabets A, P is a tuple 
7T = (y, entry, exit, loop, B, T), where 

— y is a finite set of program nodes] 

— entry is an initial node^ entry G V, 

— exit is a terminal node, exit G V, 

— loop is a dead node, loop G V, 

— B : y — >■ A* is a binding function, associating every node with some term 
so that i?(entry) = S(exit) = S(loop) = A ; 

— T : (y~ {exit})xC — >■ y is a total transition function such that T(loop, d) = 
loop for every S from C. 

In essence, a PSP may be thought of as a finite-state labelled transition system 
representing the control structure of a sequential program. We extend a binding 
function B to the sequences of program nodes by assuming B{v\,V 2 , ■ . ■ ,Vk) = 
B{vi)B(v 2 ) . ■ . B{vk). By the size |7 t| of a given PSP tt we mean the number \V\ 
of its internal nodes. 
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1.2 Semantics of PSP 

The semantics of PSP is defined by means of dynamic Kripke structures (frames 
and models) (see [El)- 

A dynamic deterministic frame (or simply a frame) over alphabet A is a 
triple T = (S', sq, R), where 

— S is a non-empty set of data states, 

— So is an initial state, sq € S, 

— R : S X A — >■ Sisan updating function. 

R{s, a) is interpreted as a result of the application of an action a to a data state 
s. It is assumed that each basic action a G A transforms deterministicaly one 
state into another; therefore, the dynamic frames under considerations are both 
functional and serial relative to every action a G A. 

An updating function R can be naturally extended to the set A* as follows 

R*{s,X) = s, R*{s,ha) = R{R*{s,h),a). 

We say that a state s" is reachable from a state s' {s' s" in symbols) if 
s" = R*{s',h) for some h G A*. Denote by [hjjr the state s = R*{so,h) which 
is reachable from the initial state by means of A-sequence h. As usual, the 
subscript T will be omitted when the frame is understood. We will deal only 
with data states that are reachable from the initial state. Therefore, we may 
assume, without loss of generality, that every state s G S is reachable from the 
initial state sq, i-e. S = {[h] : h G A*}. 

A frame Rg = {S',s,R') is called a subframe of a frame R = (S,So,R) 
generated by a state s G S if S' — {R*{s, h) : h G A*} and R' is a restriction of 
R to S' . We say that a frame R is 

— semigroup if R can be mapped homomorphically onto every subframe Rg, 

— homogeneous if R is isomorphic to every subframe Rg, 

— ordered if ^ is a partial order on the set of data states S, 

— universal if [h] = [g] implies h = g for every pair h, g of A-sequences. 

Taking the initial state sq = [A] for the unit, one may regard a semigroup 
frame R as a, finitely generated monoid R = (.S',*) such that [h] * [g] = [hg]. 
Clearly, the universal frame U corresponds to the free monoid generated by A. 
Consequently, an ordered semigroup frame is associated with a monoid whose 
unit [A] is irresolvable, i.e. [A] = [gh] implies g = h = X, whereas a semigroup 
corresponding to a homogeneous frame is a left-contracted monoid, i.e. [gh'] = 
[gh"] implies [h'] = [h"]. 

A dynamic deterministic model (or simply a model) over alphabets A,V is a 
pair M = {R, such that 

— R = {S, Sq, R) is a frame over A, 

— ^ : S — >■ C is a valuation function, indicating truth-values of basic propo- 
sitions at every data state. 



The Equivalence Problem for Computational Models 137 



Let 7T = (V, entry, exit, loop, B, T) be some PSP and M = (iF, be a model 
based on the frame T = {S,Sq,R). A finite or infinite sequence of pairs 

r = (1) 

where Vi G V, 6i G C, i > 0, is called a run of tt on M if m meets the following 
requirements: 

1. t;o = entry; 

2. Vi+i = T{vi,Si) for every i, z > 0; 

3. . . • ,-Cj)]) for every i, z > 0; 

4. (1) is a finite sequence iff T{vm, 5m) = exit for some m > 0. 

When a run r ends with a pair {vm, 5m) as its last element, we say that r termi- 
nates, having the data state Sm = [B{vo,vi, . . . ,Vm)] as the result. Otherwise, 
when r is an infinite sequence we say that it loops and has no result. Since all 
PSPs and frames under consideration are deterministic, every program tt has a 
unique run tt{M) on a given model M. We write [7t(M)] for the result of 7t(M), 
assuming [7t(M)] is undefined when the run loops. 



1.3 The Equivalence Problem for PSPs 

Let tt' and tt" be some PSPs, M a model, At be a set of models, and T a frame. 
Then tt' and tt" are called 

— equivalent on M (tt' tt" in symbols) if [7 t'(M)] = [7t"(M)], i.e. either 
both runs r(7r',M) and r{Tr”,M) loop or both of them terminate with the 
same data state as their results, 

— equivalent on M. (tt' tt" in symbols) if tt' holds for every M G M, 

— equivalent on T (tt' tt" in symbols) if tt' and tt" are equivalent on every 
model M = (A, based on T. 

For a given set of models M (a frame A) the equivalence problem w.r.t. M (A) 
is to check for an arbitrary pair tti, 7T2 of PSPs whether tt' tt" {tt' tt") 
holds. 



1.4 Decidable and Undecidable Cases of the Equivalence Problem 
for PSPs 

Since we are interested in the algorithmic aspects of the equivalence problem, it 
is assumed that frames and models we deal with are effectively characterized in 
logic or algebraic terms (say, by means of dynamic logic formulae or semigroup 
identities). It is also clear that the undecidability of the identity problem “[h'] = 
[h"]T' for a frame T implies the undecidability of the equivalence problem for 
PSP on T . Therefore, we direct our attention only to those semantics (frames 
and sets of models) that have rather simple and natural arrangement. 
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Yanov schemata. The computational model of Yanov schemata [124158111; was 
the first attempt to provide a precise mathematical basis for the common activ- 
ities involved in reasoning about computer program. It is also the first case of 
computer program formalization whose equivalence problem was proved to be 
decidable. In the framework of our concept Yanov schemata may be thought of 
as PSPs whose semantics is based on the set of dynamic models characterized 
by PDL axioms 

[ai]pj = pj 

for every basic action and basic proposition pj which is not in the shift Sh(ai). 
In fact, to check the equivalence of Yanov schemata it suffices to consider their 
behaviour only on the universal frame U which is based on the free monoid gen- 
erated by A. This demonstrates close relationships between Yanov schemata and 
finite automata It was revealed in m and opened up fresh opportunities 
for applications of formal methods in computer program analysis. 

Ordered frames. When a set of states S' of a frame T is partially ordered w.r.t. 
Yit-, this means that no actions in A are invertible and, hence, PSP’s compu- 
tations always progress. First results on the equivalence problem for PSPs on 
ordered frames were obtained by Letichevsky. 

Theorem 1 (^7|). Suppose T is a homogeneous ordered frame sueh that the 
identity problem “\g\ = [h\?” is decidable in time ip{n). Then the equivalence 
problem “tti 7T2?” is decidable in time 0{2"‘'ip{n)). 

In many cases the complexity of decision procedures can be improved by 
taking into account some specific properties of the frames. 

Let T = (S,so,R) be a semigroup frame. Considering it as a monoid, we 
write T y. T for the direct product of the monoids. Suppose IT is a finitely 
generated monoid, 17 is a submonoid of W, and w^,w* are two distinguished 
elements in W. Denote by o and e a binary operation on W and the unit of 
W respectively. We say that the quadruple K = (W,U,w~^ ,w*) is a ko-criteria 
system for the semigroup frame if, where ko is some positive integer, if K and 
T meet the following conditions: 

1 . there exists a homomorphism p oi T y T va. \J such that 

[h] = [g] o if{{[g], [/i])) ow* = e 

holds for every pair g,h in A*, 

2. for every element w in the coset U o w* the equation y o w = e has at most 
fco pairwise different solutions y in the coset w'^ o U. 

Theorem 2 (|59|). Suppose an ordered semigroup frame T satisfies the follow- 
ing requirements: 

1. the reaehability problem “[g] [h]?” is decidable in time ti{m), where 

m = max(|g|, \h\); 
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2 . T has a ko-criteria system K = {W, U, w’*', w*) such that the identity problem 
“wi = W2?” in W is decidable in time t2{m), where m = max(|xi;i|, |tU2|)- 

Then the equivalence problem "tti 7 T 2 ?” is decidable in time 0 (n^(<i(n^) + 
^2 (cn^ ) + log n)) , where n = max(| 7 Ti |, | 7 T 2 |) . 

By finding appropriate criterial systems one may construct uniformly polyno- 
mial-time decision procedures for the variety of well-known frames. Consider, 
for example, a partially commutative monoid Ti induced by some independency 
relation / 1 ^ on the set of basic actions. 

Corollary 1 (Ea). The equivalence problem for PSP on the partially commu- 
tative frame Tj is decidable in time 0 {n^logn) 

Some results on the equivalence problem for PSPs on ordered frames which 
are not semigroup were obtained in M- 

Let T be an arbitrary frame. For every pair of states (s', s") denote by 
/(si,S2) the set of pairs of terms (/ii,/i2), such that R*{si,hi) = R*{s2,h2). 
Two pairs (s^,S2) and (s",S2) are said to be similar {{s[,S2) ~ in 

symbols) if I{s'i,S2) = /(s",S2) holds. We denote by (s'^jS^)/ ~ the similarity 
class containing a pair (s'j^, S2). 

Theorem 3 ([hlj). Suppose that an ordered frame T satisfies the following 
requirements: 

1 . the reachability problem “\hi] -<-r [/12I on T is decidable in time 
^>(max(|ft,i|, |ft,2|)); 

2 . the similarity problem “{[h{], [h'2]) {[hf], [h'f]) 1 ” is decidable in time 

3. for any terms hi, /12 the set {([iLi], [H2]) / : [Hihi] = [^2^12]} contains 

at most 0 (max(|/ii|, I/12I)) similarity classes, 

where <T, W and O are monotonic recursive functions. Then the equivalence 
problem tt' 7 t" ? is decidable in time O(( 0 ^(n)^(n^ 0 (n))!f'(n'* 0 (n))n® logn), 
where n = max(| 7 r'|, | 7 t'|). 

Group frames. If is not a partial order on the set of states S' of a semigroup 
frame J-, then the monoid T contains invertible elements. These elements form 
a subgroup in T , and the decidability of the equivalence problem for PSPs on 
T depends essentially on this group. 

Theorem 4 (EUD). Suppose a homogeneous frame T is associated with a 
right- contracted monoid whose identity problem ‘\g\ = [h] ?” is decidable. Then 
the equivalence problem w.r.t. T is decidable iff it is decidable w.r.t. T' , where 
T' is the maximal subgroup of IF. 

A uniform criterion for selecting groups with decidable problem of program 
equivalence was presented in paiinj . 
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Let So, R) be a frame corresponding to some group, oi, 02, . . . , Ofe be a 

finite sequence of actions, and g, h be some terms. Then the sequence of states 
[g], [501], [gai, 02], ... , [<70102 . . . o„] is called a trajectory in T' . If [<70102 . . . o„] = 
h then we say that this trajectory joins g to h. By the distance p{g, h) between 
states g and h we shall mean the length of the shortest trajectory joining g to h. 
A set of states H, H C S, is called complete if for any h, h G H, there exists a 
trajectory joining sq to h, all of whose elements belong to H, i.e. iL is a simply 
connected subset of S containing the initial state Sq- 

Let H be some complete set of states. Consider the relation g <h h on S 
which is true iS g h and each trajectory joining g to h passes through some 
state [gh'] such that [h'] G H. 

A frame T' = (5, sq, R^ is called (j-frame POj if it contains a finite complete 
set H satisfying the following requirement: 

there exists a positive integer N such that, for any states g,h in T as soon 
as g <H h and p{g, h) > N holds, there exists a state / which satisfies 

g <H f <H h. 



Theorem 5 ([ 3 Uj). If T' is a fl-frame then the equivalence problem for PSP 
on T' is decidable. 

It was demonstrated (see |I7E6I30I52| 1 that a lot of well-known groups (finite 
groups, free groups, free commutative groups, direct products of free groups, 
etc.) fall into the category of / 3 -groups and have a decidable problem of PSP 
equivalence. But in all these cases the complexity of the decision procedures is 
at least exponential of the size of programs. 

On the other hand, Yu. Gurevich noticed that any Turing machine can be 
simulated by a PSP running on a non-trivial Abelian group. This provides a 
grounding for the following theorem. 

Theorem 6 ([ 26 j). Suppose that a frame T is Abelian group and rank{T) > 2. 
Then the equivalence problem for PSPs on T is undecidable. 

Letichevsky m showed that the equivalence problem for PSPs is decidable 
when T is Abelian group of rank 1 . 

Multitape automata. Combining some PSP’s semantics that are easy for solving 
the equivalence problem one may get computational models which are extremely 
complicated for analysis. Thus, for example, by joining together a frame Tc. 
based on a free commutative monoid and a valuation function ^ specified by 
Yanov shifts we get a model which has exactly the same computational power as 
deterministic multitape finite automata. In the framework of PSPs the semantics 
of multitape automata is specified by the following PDL axioms 

Qi, a j G A. 

ai G A, Pj G Sh{ai) 



Pj = [ai]P], 



(2) 
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where a shift Sh : A —^2^ meets a requirement 

Qi Qj => Sh{ai) n Sh{aj) = 0. 

The notion of multitape finite automata was introduced by Rabin and Scott 
in their classical paper of 1959. The equivalence problem for deterministic 
multitape finite automata is one of the most famous problem in automata the- 
ory. It remained open for more than 30 years. The difficulty of this problem is 
manifested in the following facts. In HSj it was noticed that the inclusion prob- 
lem for deterministic multitape finite automata is undecidable. In 1968 Griffiths 
m proved the undecidability of the equivalence problem for nondeterministic 
multitape automata. On the other hand, by replacing axioms (j2|) by their weak 
variants 

Pj [ai]pj, Oj e A, Pj G Sh{ai), 

one get a semantics composed of monotonic commutative models. In 1421 it was 
demonstrated that the equivalence problem for PSPs on monotonic commutative 
models is decidable. 

In |4|5tij positive solutions to the two-tape case of the equivalence problem 
were given. It was demonstrated also (see [,'11 12;il7] l that the equivalence problem 
is decidable on some specific classes of deterministic multitape finite automata. 
In 1991 Harju and Karhumaki reduced the equivalence problem for determin- 
istic multitape automata to the multiplicity equivalence problem for one-tape 
automata, attacked the latter by generalizing Eilenberg’ Equality Theorem m, 
and finally proved the following theorem. 

Theorem 7 (Jl3]). The equivalence problem for the n-tape deterministic finite 
automata is decidable. 

To the extent of our knowledge there are no results so far on the complexity 
of the equivalence problem for multitape deterministic finite automata. 

Reset actions. Suppose that A contains some distinguished actions bi, . . . ,bk 
which reset any PSP’s run to some prescribed states. These actions corresponds 
to the right zeros in monoids; their semantics is specified by the axioms 

The equivalence problem for semigroup frames with right zeros was studied in 
I tij . In pH] it was proved that the equivalence problem for PSPs with reset 
actions is decidable in time which is double-exponential of the size of programs 
to be analyzed. In m it was proved that this problem is decidable in time 
which is polynomial of the size of programs and exponential of the maximal 
number of occurrences of each reset action in programs. It was demonstrated also 
that a variety of problems in polynomial-time hierarchy (3-SAT, the emptiness 
problem for the intersections of n deterministic n-state finite automata) are 
interreducible with various cases of the equivalence problem for PSPs supplied 
with reset statements. 



142 



Vladimir A. Zakharov 



On the other hand, by providing deterministic two-tape finite automata with 
a single reset action we get one of the most simple deterministic model of com- 
putations whose equivalence problem is undecidable m- 

2 First-Order Sequential Programs 

In this section we consider the concept of first-order sequential program (FOSP), 
its sintax, and semantics, and discuss the principal results on the equivalence 
problem for this model of computations. 

2.1 Syntax of FOSP 

Terms and substitutions. Fix three alphabets X, F, R of variables, function sym- 
bols, and relation symbols, respectively. Without loss of generality we assume 
that the set of variables X = {a;i, . . . , xn} is finite. 

The set of terms T ermx over X and F is defined in the usual way. When a 
variable Xi occurs in a term t, we denote this fact by writing Xi G t. A substi- 
tution on X is a mapping 9 from X to Termx- We write 0 as a set of bindings 
{cci/ti, . . . , XAr/tAr}, assuming, in contrast to P, that U = Xi is possible. The 
term ti which binds a variable Xi in 9 is denoted by 9[i\. A set of variables 
{xi : Xi ^ 9[i]} is called a domain Dom{9) of a substitution 9. A set of variables 
{xj : 3k Xj G 9[j]} is called a range of 9. We use e to denote empty substitution 
{aJi/cci, . . . ,xx/xx}, and write r]9 for the composition of substitutions rj and 9. 
The application of a substitution 9 to an expression E (term, atom, formula) is 
defined as in P and denoted by E9. The set of all substitutions on X is denoted 
by Subst^. 

As far as sequential programs are concerned, a binding Xi jti may be thought 
of as an assignment Xi := ti, while a substitution 9 stands for a parallel 
composition of assignments. 

Atoms and conditions. Formulae of the form p{xi.^, . . . ,Xi^,), where p is a fc-ary 
relation symbol and , . . . , Xi^, are variables in X, are called atoms. A literal 
based on an atom A is either A itself or its negation -•A. Consider some finite set 
of atoms A = {Ai, . . . , Am}. Then a M-tuple C = (Li, . . . , Lm) of literals based 
on the atoms Ai , . . . , Am is called a condition. We write for the set of all 
possible conditions based on the set A of atoms. Clearly, \Ca\ = 2^. Conditions 
stand for the conjunctions of basic tests used in conditional statements if-then- 
else and loop statements while-do. 

Term interpretations. A term interpretation I for X, F, R is defined as follows 
(see P). 

— the domain of / is the term universe Termx', 

— each fc-ary function symbol / in F is assigned the mapping from {Termx)^ to 
Termx which maps a sequence t\, . . . ,tk of terms to the term f{t \, . . . , tk)', 

— each /c-ary relation symbol p in i? is assigned a set p^ of fc-tuples of terms. 
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Given a term interpretation I, an atom A = and a substitu- 

tion 9 in Subst^, we say that I satisfies the atom A {the literal ^A) for 9 iff 
. . . ,Xi^9) G (resp. {xi^9, ...,Xi^9) ^ p^) holds. 



First-order sequential programs. Consider alphabets X, F, R, and a finite set 
of atoms A. Then a sequential program over X, ^ is a finite-state labelled 
transition system represented by a tuple tt = {V, entry, exit, loop, S, T), where 

— y is a finite set of program nodes; 

— entry, exit, loop are some distinguished program nodes; 

— S' : y — >■ Substx is a total assignment function associating every program 
node with some substitution on X; 

— T : (y — {exit}) x — >• (y — {entry}) is a total transition function such 

that T(loop, C) = loop holds for every condition C in 

The nodes entry, exit, loop are called initial, terminal, and dead nodes, re- 
spectively. All other nodes in V are called internal nodes. By the size | 7 t| of a 
given program tt we mean the total number of symbols involved in tt. Given a 
finite sequence of program nodes v\, . . . , v„-i,Vn we write S{vi, . . . , v„) for the 
composition S(u„)S(u„_i) . . . S(ui) of substitutions associated with the nodes. 



Example 1. Consider a sequential program tt written in traditional style. 

i:=l; M:=0; 
while i <= n 

do if a[i] > M then M:=a[i] ; i:=i+l od. 



This program is intended for selecting the maximal positive element M in the 
integer array a[l:n] . According to the definition above it is represented by the 
following transition system over the set of variables X = {z, M}, the set of 
functions F = {a^^^}, and the set of atoms A = {Ai,A 2 }, where Ai = (i < n) 
and A 2 = (a(z) > M) 



loop 





entry 


C2 






C3 


C4 






Vd G4 _ 






Cl 


\ \C2 C2 

V^\0^ 


y 


Cl 



exit 



0i = {z/l,M/O}, 

92 = {ili+l,M/M}, 

03 = {i/i + 1, M/a{i)}, 

04 = e, 

Cl = (-'Ai, -1A2), 

C2 = (-'^1,^2), 

C's = (Ai,-'A2), 
c *4 = (^1,^2), 



Fig. 1 
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2.2 Semantics of FOSP 

Consider a program tt = (V, entry, exit, loop, S', T). A sequence of pairs 

w = {vo, Co), (ui. Cl), . . . , {vi, Ci), (3) 

where vq,vi, . . . ,Vi, . . . are program nodes and Cq,Ci, . . . ,Ci, . . . are conditions, 
is said to be a path in tt if uq = entry and = T{vi,Ci) holds for every 
i, z > 0. A path w is called total if either w is infinite or it is finite and has a 
pair (exit, C) as its last element. 

Given a term interpretation I, we say that a total path is a run of tt on 
I if for every pair (vi,Ci) the interpretation / satisfies each literal L in Ci for 
the substitution S{vq,Vi, . . . ,Vi). The pairs (vi,Ci), i > 0, are said to be the 
states of the run. Whenever a state {v, C) occurs in (0) then we say that the run 
w reaches this state. Since the programs under consideration are deterministic, 
every program tt has a unique run on a given interpretation I. Denote this run by 
7 t(/). If 7 t(/) reaches the final state {vm,Cm) such that Vm = exit, then we say 
that the run terminates having the substitution S(yo,Vi, . . . ,Vm) as the result. 
Otherwise, when tt{I) is an infinite sequence, we say that it loops and has no 
result. We write r(7r, I) for the result of 7r(J), assuming r(7r, I) is undefined when 
the run loops. 

Programs tt' and tt" are called equivalent [tt' ^ tt" in symbols) if rfir' ,1) = 
r(Tr" ,1) for every term interpretation I. The equivalence problem for sequential 
programs w.r.t. term interpretations is to check for an arbitrary pair tti, tti of 
programs whether tt' ~ tt" holds. 

Along with the equivalence relation some other properties of FOSP are im- 
portant for computer program analysis. A program tt is called empty (total) if 
r(7T,/) is undefined (respectively, defined) for every interpretation I. A program 
TT is called free if every total path (0 in tt is a run tt(I) of tt on some interpretation 

/. 

2.3 Decidable and Undecidable Cases of the Equivalence Problem 
for FOSPs 

Undecidability results. An extensive study of the equivalence problem for a first- 
order model of sequential programs began since 1968, when M. Paterson isni 
introduced a concept of formalized computer program and set up decision prob- 
lems for this model of computations. The definition of FOSP above is but a 
modification of this concept expressed in terms of transition systems and substi- 
tutions. In 1970 D.C.Luckham, D.M.Park and M.S. Paterson [22| and Letichevsky 
m showed that multihead one-tape finite automata can be simulated by means 
of FOSP. Since the emptiness problem for multihead finite automata is undecid- 
able, we arrive then at the following 

Theorem 8 ( [32 27J b The equivalence, emptiness, totality and freedom prob- 
lems for FOSP are undecidable. 
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A serious effort was mounted to reveal to what extent program structure, al- 
phabets and interpretations affect the undecidability of the main decision prob- 
lems for FOSPs. 

In |1 8IS6I,S7I41 1 it was shown that the boarder between decidability and un- 
decidability for FOSPs is very subtle. 

When |A| = 1 and all functions and predicates are unary we have FOSPs 
with inseparable memory. Programs of this kind corresponds to Yanov schemata. 
When |A| = n, all predicates are unary, and all substitutions involved in pro- 
grams are ones of the form {xi/ f{xi)}, we obtain deterministic n-tape finite 
automata. The main decision problems for both classes of FOSP are, clearly, 
solvable. But as soon as some minor modifications are made in these computa- 
tional models, there comes a point of undecidability. 



Theorem 9 (I36j). Let X = {x\,X 2 }, F = {/^^^}, P = Suppose also 

that only substitutions of the form {xi/ f{xi)}, {X 2 / f(x 2 )} , and {X 2 /X 1 } are 
available in programs. Then the emptiness problem for FOSPs over (X, F, P) is 
undecidable. 



The alphabet Ai = {X, F, P) used in the theorem above is the minimal al- 
phabet which guarantees the undecidability of the emptiness problem for FOSPs, 
since any reduction of Ai leads to the well-known decidable cases of FOSPs. It 
should be noticed also that FOSPs over Ai may be thought of as deterministic 
two-tape finite automata supplied with an extra action for inserting a content 
of one tape into another. 

Theorem 10 ([37]). Let X = {xuX 2 }, F = {c(o),/W,sW}, P = {pW}. 
Suppose also that only substitutions of the form {x\/ f{xi)} , {x 2 /g{x 2 )} and 
{x\/ c,X 2 / c\ are available in programs. Then the totality problem for FOSPs 
over {X, F, P) is decidable, whereas the emptiness and the equivalence problems 
are not. 



In this theorem we deal with a class of FOSPs located on the very boarder 
between decidable and undecidable. Programs from this class correspond to de- 
terministic two-tape finite automata supplied with a reset action. 

Paterson’s and Letichevsky’s results W127\ were greatly strengthened by 
Itkin and Zwinogrodski m They showed that the equivalence problem for 
FOSPs is undecidable for any “reasonable” equivalence definition based on the 
set of all interpretations and applicable to any FOSP. It is assumed that an equiv- 
alence of FOSPs is “reasonable” if it meets the following requirements specified 
below informally. 

1. An equivalence relation is said to consistent if non-equivalence of two pro- 
grams 7Ti and 7T2 implies the existence of interpretation / for which runs 
7Ti(/) and 7T2(/) differ. 

2. An equivalence relation is said to be nonsingular if the existence of statement 
which, in some interpretation I, would be essential for the result of the run 
7Ti (/) and non-essential for the result of the run 7T2(/) implies non-equivalence 
of 7Ti and 7T2. 
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Theorem 11 ([2l]). If the equivalence relation ~ on FOSPs is consistent and 
non-singular then the equivalence problem “tti ~ 7T2?” is undecidable. 

In the theory of first-order sequential programs this theorem plays exactly 
the same role as Rice’s theorem for universal computer models. 



Decidability results. The theorem above implies that decidable cases of the equiv- 
alence problem for FOSPs could be obtained either by turning from functional 
equivalence to some crude non-interpretive equivalences which are more readily 
solved, or by imposing additional constraints on the structure of programs and 
selecting thus some wide classes of programs with solvable problem of functional 
equivalence. 

The first line of investigations has led to a concept of logic-term equivalence 
of FOSPs |E2] which is as follows. For every path (EJ going in a program tt = 
{V, entry, exit, loop, S, T) from entry to exit a sequence 

CoS{vo),CiS{vo,vi), . . . ,C'jS'(uo,ui, . . . ,Uj), . . . 

of conditions instantiated by compositions of substitutions S{vq, ui, . . . , Uj), i> 
0, assigned to program nodes Vq,Vi, . . . ,Vi passed along this path, is referred to 
as a determinant of a path O- Denote by det^ir) the set of determinants of all 
terminated paths in a program tt. Two programs tti and 7T2 are called logic-term 
equivalent if det{'Ki) = det{'K 2 ). 

It is apparent that det^ir) is a pure syntactical characteristic of a program tt 
since it does not refer to any interpretations. It is easy to show also that det{ni) = 
det{n 2 ) implies tti ~ 7T2. Therefore, the logic-term equivalence is a reasonable 
approximation to the interpretative equivalence. In m it was demonstrated that 
the logic-term equivalence can be checked effectively. 

Theorem 12 (|22p. The logic-term equivalence problem for FOSPs is decid- 
able. 



A timed complexity of a decision procedure for the logic-term equivalence pre- 
sented in P21 is double-exponential of the size of programs. Thereafter 
the complexity of the logic-term equivalence checking procedures were improved 
substantially and reduced to OfnJ). Due to the high efficiency of these algo- 
rithms, they are commonly used for data flow analysis and optimization of com- 
puter programs. 

The alternative line of investigations provides the study of the equivalence 
problem for various classes of FOSPs. Results obtained in [27l,S2p,'I6l,S7] show that 
the undecidability of the equivalence problem is induced by some specific features 
of substitutions involved in FOSP. By constraining the variety of substitutions 
used in programs we arrive at classes of FOSPs with decidable equivalence prob- 
lem. 

The first example of non-trivial class of programs with solvable equivalence 
problem was presented in j,'I2l41)j . A FOSP 7T = (P, entry, exit, loop, S', T) is 
called progressive if all substitutions 9 involved in tt are of the form {x/t}, 
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and for every pair of adjacent nodes u,v such that S{u,C) = v, an inclusion 
Dom{T{u)) C Rang{T{v)) holds. 

Theorem 13 ( [32 :4DJ ). The equivalence problem for progressive programs is 
decidable. 

A FOSP 7T is called conservative if Xt € 9[i] holds for every variable Xt, Xi € 
X, and for every substitution 9 involved in tt. Conservative programs were in- 
troduced in m- Further investigations have shown that the decidability of the 
equivalence problem for FOSPs is closely related to conservative property. 

In it was demonstrated that the equivalence problem is decidable for 
conservative FOSPs whose atoms are of the same arity N, where N = |A|. In 
fact, this class of FOSPs corresponds to the computational model of PSPs whose 
semantics is based on partially commutative frames Ti (see 1431 1 . 

In 1191 it was proved that the equivalence problem is decidable for conserva- 
tive FOSPs whose atoms are of the form p{xi) and all bindings {x\/t} for the 
variable X\ are such that X\ G t. 

Since all decidable cases above deal with free FOSP, this has led M. Paterson 
to a belief that the equivalence problem should be decidable for free programs. 
Paterson’s conjecture is not ruled out so far and gained new testimonies in its 
favor. 

In |Sn| Sabelfeld introduced a class of weakly conservative programs which ex- 
tends the class of conservative programs. A FOSP tt is called weakly conservative 
if Dom(9) C Rang(9) for every substitution 9 involved in tt. 

Theorem 14 (isni)- The equivalence problem for free weakly conservative pro- 
grams is decidable. 

This theorem generalizes the results obtained in izHcn). The only difficulty in 
its application is that the freedom problem for conservative programs is generally 
undecidable. 

In 03 a class of orthogonal programs was introduced. We say that terms t 
and s match if one of the terms occurs in the other. Otherwise, a pair of terms 
t, s is said to be orthogonal. A term with no variables is called ground. We also 
say that a substitution 9 is orthogonal if it is not a renaming, and each pair of 
terms 9[i],9[j], I <i < j < N,is orthogonal. 

Example 2. A substitution 9 = {g{x 2 ,a)/xi,X 3 /x 2 ,g{f{xi,a),X 2 )/x 3 } is or- 
thogonal and a term /(xi, a) matches 9. 

Given alphabets (X, F, R), X = {xi, X 2 , . . . , Xat}, and a finite set of atoms A 
we say that a program tt = {V, entry, exit, loop, S, T) over X, F, A is orthogonal 
if TT meets the following requirements: 

— every internal node v is associated with some orthogonal substitution S(y) 

whose terms . . . , are non-ground; 

— all atoms in A are ones of the form p(xi, . . . , Xtv), i-e. have all possible 

variables in X as their arguments. 
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An example of orthogonal program is depicted in Fig.l. 

The class of orthogonal FOSP is in a sense dual to the class of progressive 
programs E2E3- It is particularly remarkable that the orthogonal programs are 
free, though some of them are not weakly conservative. 

Theorem 15 ( [60J ). The equivalence problem for orthogonal programs is de- 
cidable. 

To the extent of our knowledge the classes of weakly conservative programs 
and orthogonal programs are the largest classes of FOSP with decidable equiv- 
alence problem. 



3 Techniques for Proving Decidability/Undecidability 
of the Equivalence Problem 

The undecidability results are usually obtained by applying common reduction 
technique to the well-known unsol vable problems (halting problem for Turing 
machines I2H!, emptiness problem for multihead finite automata m Post cor- 
respondence problem ED)- 

A common way to prove the decidability of the equivalence problem for de- 
terministic computational models is to find out a method for demonstrating the 
equivalence of two programs. This gives a semiprocedure for equivalence; the 
semiprocedure for nonequivalence trivially exists. The most-used techniques for 
proving the equivalence of sequential programs are ’’path” method, synchroniza- 
tion method, and reduction to a normal form. 

Initially, the “path” method was introduced for proving lower bounds for 
computations j^j. This method takes an advantage of the specific phenomenon 
which is the very nature of some simple models of computation. It is as follows. 
Suppose that a program tt (automaton) for some input data d has a run r(7r, d). 
If a length |r(7r, d)| exceeds some bound L(tt) which depends on program tt only, 
then r(7T, d) can be broken down into fragments r(7r, d) = rir^r^ so that joining 
the initial and the final fragments r\ and r^ we get a run of r(7r, d') = rira of 
7T for some other input data d' . This effect has much in common with Pumping 
Lemma which is widely used for proving non-regularity of formal languages . It 
can be also applied for proving the equivalence of programs. Suppose that, given 
a pair of programs tti, 7T2, the results of some “long” runs of tti and 7T2 reveal 
the difference between the functions realized by these programs. Then some 
“short” runs distinguish tti and 7T2 as well. The equivalence problem is effectively 
solvable when the boundary between “long” and “short” computations depends 
recursively on some syntactic characteristics of tti and 7T2 . In this case one needs 
only to check the behaviour of programs on the finitely many runs to decide if 
they are equivalent or not. Usually the “path” method for equivalence checking 
is combined with the synchronization technique which reduces the equivalence 
problem to the emptiness problem for some devices that simulate synchronous 
runs of programs. 
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This approach was successfully employed in 
main disadvantage of the ’’path” method is of its little use for practice since 
the number of “short” runs to be checked is very large (as a rule, it is exponen- 
tial of the size of programs tti and 7T2 under consideration) . In I43l59l60till this 
deficiency has been overcome by encoding in algebraic terms the “difference” 
between intermediate states of computations r(TTi,d) and r(TT 2 ,d). When a “dif- 
ference” becomes too large, this indicates that results of r{ni,d) and r{n 2 , d) also 
differ. By choosing an appropriate encoding one could reduce the equivalence- 
checking problem “tti 7T2?” to the well-known identity problem “wi = UI 2 ?” 
in some specific monoids W associated with A4, and to the searching problem 
in a state space of polynomial size. 

It is remarkable that whenever the “path” method gives a solution to the 
equivalence problem, then it always provides a solution to the inclusion prob- 
lem, which is to find out whether tti and 7T2 compute the same results whenever 
7Ti terminates. Since the inclusion problem for deterministic two-tape finite au- 
tomata is undecidable, the “path” method is inapplicable to the equivalence 
problem for deterministic multitape finite automata. 

The synchronization technique makes it possible to reduce the equivalence 
checking to the solution of the emptiness problem by constructing for given 
programs tti and 7T2 an appropriate device tti x 7T2. It simulates synchronously 
computations of these programs so that a language accepted by tti x tt 2 is empty 
iff 7Ti ~ 7T2. The main difficulty in the using of this method is to provide the 
decidability of the emptiness problem for the crossproduct tti x 7T2 . Nevertheless, 
in many cases (see l2l53l55lb6teYlb0l5M l this approach does work. A synchro- 
nization technique is discussed in much details in 0. 

The key idea of a reduction to a normal form is in developing a complete 
equational calculus for an equivalence relation on programs. Usually, the com- 
pleteness is proved by demonstrating that each pair of equivalent programs tti 
and 7T2 can be transformed to the common normal form ttq. This approach has 
been used advantageously in l17ti'ilTil49l,^0I^ . 
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Abstract. P systems have been recently introduced by Gh. Paun as a 
new model for molecular computations based on the notion of membrane 
structure. In this work, we consider Rewriting P Systems which use elec- 
trical charges to move objects between the membranes. We show that 
electrical charges, a feature introduced in order to obtain more “realis- 
tic” systems, induces simple systems: we define two normal forms for two 
variants of Rewriting P Systems which make use of electrical charges. 



1 Introduction 

The P systems were recently introduced in PJ as a class of distributed parallel 
computing devices of a biochemical type. 

The basic model consists of a membrane structure composed by several cell- 
membranes, hierarchically embedded in a main membrane called the skin mem- 
brane. The membranes delimit regions and can contain objects. The objects 
evolve according to given evolution rules associated with the regions. A rule 
can modify the objects and send them outside the membrane or to an inner 
membrane. Moreover, the membranes can be dissolved. When a membrane is 
dissolved, all the objects in this membrane remain free in the membrane placed 
immediately outside, while the evolution rules of the dissolved membrane are 
lost. The skin membrane is never dissolved. 

The evolution rules are applied in a maximally parallel manner: at each 
step, all the objects which can evolve should evolve. A computation device is 
obtained: we start from an initial configuration and we let the system evolve. A 
computation halts when no further rule can be applied. The objects in a specified 
output membrane are the result of the computation. 

Many variants are considered in Q, Q, [3, m and |S|. 

We consider here the systems introduced in as Rewriting Super-Cell sys- 
tems (or RP systems), in which the objects can be described by finite strings 
over a given finite alphabet, instead of objects of an atomic type (i.e. elements 
of a finite alphabet). The evolution of an object will correspond to a transforma- 
tion of the string: the evolution rules are given as context-free rewriting rules. 
In a variant proposed in jS] the evolution rules do not specify the label of the 

* This work has been supported by the Italian Ministry of University (MURST), 
under project “Unconventional Computational Models: Syntactic and Combinatorial 
Methods” . 
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membrane where the objects are sent. Instead, “electrical charges” are used, as- 
sociated to membranes and to objects: they can be marked with “positive” (-I-), 
“negative” (— ) or “neutral”. An object marked with -|- (respectively — ) will 
enter a membrane marked with — (respectively -I-), nondeterministically chosen 
from the set of membranes immediately inside to the membrane delimiting the 
region where the object is produced. The neutral objects are not introduced into 
an inner membrane. 

As stated in this feature is useful to obtain more “realistic” systems (with 
biochemical features instead than artificial ones). We show that this feature 
induces simple systems: we give two normal forms for RP systems using this 
feature. First, we will show that every language generated by a RP system with 
priority and without dissolving action can be generated by an equivalent system 
with two rules in each membrane and with a simple structure (the skin membrane 
contains elementary cells only). The second normal form is about RP systems 
without priority: we will show that every systems of that type (under certain 
restrictions) is equivalent to a system of the same type with three rules in each 
membrane. 

A normal form for P-system can be found in m- 

2 Rewriting P Systems 

A membrane structure is a construct consisting of several membranes placed in a 
unique membrane; this unique membrane is called a skin membrane. We identify 
a membrane structure with a string of correctly matching parentheses, placed 
in a unique pair of matching parentheses; each pair of matching parentheses 
corresponds to a membrane. 

A membrane identifies a region, delimited by it and by the membranes imme- 
diately inside it (the membranes are said to be adjacent to the delimited region). 
If we place multisets of objects in the regions from a specified finite set V , we 
get a super-cell. 

A super-cell system (or P system) is a super-cell provided with evolution 
rules for its objects and with a designated output membrane. The depth of a P 
system is equal to the height of the tree describing its membrane structure. 

We consider here Rewriting Super- Cell Systems (RP Systems) with polarized 
membranes. In such systems, objects can be described by finite strings over a 
given finite alphabet. The evolution of an object will correspond to a transforma- 
tion of the string. Consequently, the evolution rules are given as rewriting rules. 
We only use context-free rewriting rules. The rules in a region can mark the 
objects to let them pass through the membranes as described in the following. 

Such a system of degree n, n > 1, is a construct 

n = {V, n, Ml, . . . , Mn,{Ri, Pi), . . . ,{Rn, Pn),io), where: 

— P is an alphabet 

— pis a, membrane structure consisting of n membranes (labeled with 1, ..., n); 

each membrane is marked with one of the symbols -|-, — , 0. 



Two Normal Forms for Rewriting P Systems 



155 



— Mi, 1 < i < n are finite languages over V. 

— Ri,l < i < n are finite sets of context free evolution rules. They are of the 

form X v{p) where X is a symbol of V and v = v or v = v 6. v is a string 

over y, 5 is a special symbol not in V and p G {here, out, +, — }. Often, the 

indication ’’here” is omitted. Note that the substring (p) is not inserted into 
the sentential form as subword. It is only used to communicate the objects 
through the membranes. 

“ Pi, 1 < i < n. are partial order relations over Ri 

— io is the output membrane 

The membrane structure p and the finite languages constitute the 

initial configuration of the system. We can pass from a configuration to another 
one by using the evolution rules in parallel on all strings which can be rewritten, 
obeying the priority relations. Note that each string is processed by one rule 
only; the parallelism refers to processing simultaneously all available string by 
all applicable rules. If several rules can be applied to a string, then we take only 
one rule and only one possibility to apply it and consider the obtained string as 
the next state of the object described by the string. 

A rule can mark the object with +, — , out, here. If a rule marks a string 
with here (or if the mark is omitted), it means that the string obtained after the 
rule is applied will remain in the same region where the rule is applied. If the 
mark is out, the string will be sent to the region placed immediately outside. If 
the string is marked with + (or — ), it will be sent through an inner membrane 
marked with — (respectively +) and adjacent to the region where the rule is 
applied. If no such a membrane exists, then the rule cannot be applied. 

If a rule contains the special symbol 5 then the membrane where the rule is 
applied is dissolved and it is no longer recreated; the objects in the membrane 
become objects of the membrane placed immediately outside, while the rules of 
the dissolved membrane are removed. 

A sequence of transitions between configurations of a P system II, is called 
a computation with respect to II. A computation halts when there is no rule 
applicable to the objects present in the last configuration. The strings collected 
in the output membrane constitute the output of the P system. If a computation 
never stops, then it provides no output. 

It is important to underline the fact that the evolution of strings is not 
independent, but interrelated in two ways: 

1. If we have priorities, a rule ri applicable to a string x can forbid the use of 
another rule, T 2 , for rewriting another string y which is present at the same 
time in the same membrane. Then, we can apply r 2 on y only if r\ is not 
applicable to it and to the string x obtained from x by using ri. 

2. Even without priority, if a string can be rewritten forever then all strings 
are lost, because the computation never stops. 

We denote by L{II) the language generated by II and by RP^{Pri, S) the family 
of languages generated by rewriting P systems using electrical charges, priority 
and the action indicated by S. When one of the feature a G {Pri, 5} is not used. 
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we write na instead of a. If only systems with at most k membranes are used, 
then we add the subscript k to RP. 

More details and examples about P systems can be found in g|, Q and p. 
For elements of Formal Language Theory, we refer to El. 



3 Rewriting P Systems with Priority 
and Polarized Membranes 

In this section we show that every RE language can be generated by a RP 
system (with priority and without dissolving action) with exactly two rules in 
each region and of depth two. 

In the following proof we use the notion of matrix grammar. Such a grammar 
is a construct G = (N,T,S,M,C), where N, T are disjoint alphabets, S £ N, 
M is a finite set of sequences of the form {Ai — >• x\,...,An — t x„), n > 1, of 
context-free rules over NAT (with Ai £ N,Xi £ (N UT)*, in all cases), and C 
is a set of occurrences of rules in M {N is the nonterminal alphabet, T is the 
terminal alphabet, S is the axiom, while each sequence in M is called matrix). For 
w,z £ (NUT)* we write re => z if there is a matrix {Ai — >• X \, ..., A^ -£ x„) in M 
and the strings Wi £ (NUT)*, 1 < i < n+1, such that w = Wi, z = Wn+i, and, for 
all 1 < f < n, either Wi = WiAiWi , Wi+i = Wi XiWi , for some Wi,Wi £ (N U T)*, 
or Wi = Wi+i,Ai does not appear in Wi, and the rule Ai — >■ Xi appears in C 
(the rules of a matrix are applied in order, possibly skipping the rules in C if 
they cannot be applied; we say that these rules are applied in the appearance 
checking mode.) If C = 0 then the grammar is said to be without appearance 
checking (and C is no longer mentioned). We denote by =>* the reflexive and 
transitive closure of the relation The language generated by G is defined by 
L(G) = {ic G T*\S =>* w}. The family of languages of this form is denoted by 
MATac- When we use only grammars without appearance checking, then the 
obtained family is denoted by MAT. 

A matrix grammar G = (N, T, S, M, G) is said to be in the binary normal 
form if N = Ni U N 2 U {5, f}, with these three sets mutually disjoint, and the 
matrices in M are of one of the following forms: 

1. (S -)■ XA), with X £ Ni, A £ N 2 , 

2. (X -)■ Y,A-)-x), with X,Y £Ni, A £ N 2 ,x £ {N 2 UT)*, 

3. (X Y,A-^f), with X,Y £Ni, A £ N 2 , 

4. (X — >■ A, A — >■ x), with X £ Ni,A £ N 2 , and x £ T* . 

Moreover, there is only one matrix of type 1 and G consists exactly of all 
rules A — f appearing in matrices of type 3. One sees that f is a trap-symbol; 
once introduced, it is never removed. A matrix of type 4 is used only once, at 
the last step of a derivation (clearly, matrices of forms 2 and 3 cannot be used at 
the last step of a derivation). According to Lemma 1.3.7 in for each matrix 
grammar there is an equivalent matrix grammar in the binary normal form. 

We denote by CF and RE the families of context-free and recursively enu- 
merable languages respectively. It is known that CF C MAT C MATac = RE. 
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Further details about Matrix grammars can be found in |2| and in El. Moreover, 
in PI it is shown that the one-letter languages in MAT are regular. 

Definition 1. A RP system is in double-2- normal-form if it is of depth 2 and 
in eaeh membrane we have exactly 2 rewriting rules. 

Theorem 1. Every RE language can be generated by a RP system (with electri- 
cal charges, with priority and without dissolving action) in double-2-normal-form. 

Proof. Consider a matrix grammar with appearance checking G = {N, T, S, P, F) 
in the normal form previously described. We assume the matrices labeled in a 
one-to-one manner. With mi, . . . ,mk^ we label the matrices of type 2, with 
TOfcj+i, . . . , mk2 we label the matrices of type 3 and with ■ ■ ■ , Wfc we label 

the matrices of type 4. Moreover, we label the symbols in N with B\, . . . ,Bh. 

We show how to construct a P system of depth 2 with exactly 2 rewriting 
rules in each membrane that generates the same language of G: 

n = {V, p., Mq, Ml, . . . , Mk, Mk+i, . . . , Mk+h+i^ 

(^O.Po)i {Rl^Pl)i • ■ ■ ; {Rk+h+ly Pk+h+l), 0) 



where 

- V = NiAN2A{Z, Z' ,Z”} U {(7,11 < i < h} U T, 

~ P — [o[i]m--[fc]^[fc-i-i]fc_|_i---[fe-i-/i-i-i]fc+?i+i]o 

— Mq = {Z ZXA\S — >■ XA is the rule of a matrix of type 1} 

- Ml,..., Mfc+/i+i are empty 

- Ro = 1^0,1 : ^ A(-)} U |ro,2 : Z' -)> Z' 

— Ra = {ra 1 : W — >■ ZY} U (ro, 2 : A — >■ x{ouf)}, 1 < a < ki, {ma ' {X — ?> 
y, A — >■ a;) is a type 2 matrix) 

- Rfj = {rp^i : A — A} U {r/3,2 : X — >• ZY (out)}, ki-\-l < (3 < k2, {mp : {X —>• 
Y, A — >■ f) is a type 3 matrix) 

— Rj = {r.y^i : X — >■ Ci\ U {r-,,^2 : A — >■ x{out)}, -I- 1 < 7 < fc, {m-^ : {X —>■ 
A, A — >• a;) is a type 4 matrix) 

R'ijj — ■ Rifi—k t B.,p—}„) C {r,^ 2 ■ G.,jj—}^ ^ (out) } ,kp 1 ^ ^ 

k + h — 1, 

~ Rk+h = {'k’k+h.l ■ Bh Bfi\ U {rk-\-h,2 ■ Ch ^ Z (out)}, 

^ Rk+h+1 = |?'/c+/t+i,i ■ Z — >■ A} U {rk+h+1,2 ■ Z — >■ X(out)} 

— Pi '■ fi^i > ri^2, for every 0<i<h + k+ l 

In other words, we place into the skin membrane several membranes with posi- 
tive charge, one membrane for each matrix in (7; each membrane simulates the 
productions of a matrix in G. Moreover, we place into the skin membrane several 
membranes with negative charge, one membrane for every nonterminal symbol 
in G, used to verify that the generated strings do not contain non terminal sym- 
bols. Finally we put one further membrane, with negative charge, to stop the 
computation. 

Consider a string of the form Z ZElw in membrane 0 with w G {N2UT)* and 
H G Ni (initially we have Z ZXA). We have to apply the production Z — ?> A(— ) 
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and we get the string Z Hw. This string is sent to a membrane with positive 
charge, in which we simulate the productions of a type 2, 3 or 4 matrix. 

Membrane simulating a type 2 matrix If the string is sent to a membrane 
corresponding to a type 2 matrix, we have to apply a rule of the form X — >■ ZY 
(which simulates the first production of the type 2 matrix) and a rule of the 
form A — >■ x{out) (which simulates the second production of the type 2 matrix). 
The first production of a matrix of type 2 in the binary normal form cannot be 
of the form X ^ X, as one can see from the description of the normal form in 
Thus, if the symbol X is in the string (i.e. H = X), we have to apply this 
rule and we can do it only one time (the string cannot evolve forever in this 
membrane due to a rule X — >■ X). Otherwise, we cannot apply this rule and 
the symbol Z is not reinserted in the string. Then, we have to apply the rule 
A — >■ x{out). If the symbol A is not in w, the string cannot further evolve and it 
will not reach the output membrane; otherwise the obtained string is sent back 
in the skin membrane. As said before, if the production X — >■ ZY has not been 
applied, we have now a string of the form Z Hw , i.e. a string without the symbol 
Z. Thus, in the skin membrane we have to apply the rule Z Z (+), which 
sends the string into a membrane with negative charge. It easy to see that there 
is no such a membrane which sends back the string in the skin membrane (the 
string does not contain the symbol Z nor a symbol Ci). The computation can 
correctly proceed only if the membrane correctly simulates the corresponding 
type 2 matrix. In fact, if we apply the production X — >■ ZY and if the symbol A 
is in w, we can apply the rule A — > x{put), that sends back in membrane 0 the 
string Z ZY w , that is ready to simulate another matrix. 

Membrane simulating a type 3 matrix If the string Z Hw is sent to a mem- 
brane of type 3, we have to apply the rules A — >■ A and X — >■ ZY. We have 
the following possibilities: if the string contains the symbol A, we have to apply 
forever the production A — >■ A (due to the priority), thus the computation will 
never stop and no string will be produced. If the symbol A is not in the string, 
we can apply the production X — >■ ZY{out). li H ^ X the string cannot further 
evolve and it will remain in this membrane forever, otherwise we correctly sim- 
ulate a type 3 matrix and the string is sent back in membrane 0, where we can 
start the simulation of another matrix. 

Membrane simulating a type 4 matrix If the string Z Hw is sent to a mem- 
brane of type 4, we have to apply the rules X — >■ Ci and A — >• x{out). If the 
string does not contain the symbol A, it will remain in the membrane forever. If 
the string contains the symbol A but it does not contain the symbol X, we can 
apply the rule A — >■ x{out). The obtained string does not contain the symbol Z 
nor the symbol Ci . As we have seen before for the type 2 matrix, this string will 
reach a membrane with negative charge but it will never exit from that, thus 
no string will be generated in this way. The matrix will be correctly simulated 
only il H = X and the string contains the symbol A. In this case, we first apply 
the rule X — > Ci and then the rule A — >■ x{put). In the skin membrane we get a 
string of the form Z C\w. 
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Thus, we are able to simulate the productions of every type of matrix in 
the correct order. When the string reaches a membrane that corresponds to a 
type 4 matrix, the phase of simulating the production of the matrices has to 
be ended, and we have to control that the obtained string does not contain non 
terminal symbols. This is done with the negative charged membranes. Consider 
a string of the form Z C\w in membrane 0. Obviously, the production Z — >■ A 
cannot be applied, because the string does not contain the symbol Z . Thus, we 
have to apply the rule Z ^ Z (+). The obtained string Z Ciw is sent to a 
membrane with negative charge. There is only one membrane with a production 
that involved Ci: the membrane used to control the presence of the non terminal 
Bi, that is the membrane k + 1 ; it contains the productions Bi — ^ B\ and 
Cl — >■ C 2 {out) . If the string reaches a different membrane, it cannot further 
evolve and no string reaches the output membrane. Otherwise, we can test the 
presence of the non terminal Bi in the string: if B\ is in the string, we have to 
apply forever the production B\ ^ B\, otherwise we can apply the production 
Cl — >■ C 2 {out) and we send back in membrane 0 the string Z C 2 W. Here we can 
apply again the rule Z Z {+) to send the string in a membrane with negative 
charge. 

Now, the “correct” one is the membrane that tests the presence of the non 
terminal symbol B 2 - If the string reaches another membrane, it will be blocked. 
If it reaches the membrane k + 2 and the string contains the symbol B 2 , the 
computation will never halt, otherwise we enter membrane 0 with the string 
Z C 3 W. The computation proceeds in this way until we test all non terminal 
symbols. The sequence Ci, C 2 , . . . , C/j permits us to be sure that we test the 
presence of all non terminal symbols. 

In the membrane k + h (which checks the presence of the h — th non terminal 
symbol) we have the production Ch ^ Z (out) {Z tell us that we have checked 
the presence of all non terminal symbols). The string is sent back in membrane 
0 where we have to apply again the rule Z ^ Z (+). The string Z Z w is, sent 
to a membrane with negative charge. If it reaches the membrane k + h + 1 we 
apply the rules Z ^ X and Z — >■ X{out) that sends back in membrane 0 (the 
output one) the terminal string w; otherwise the string will be blocked in one of 
the other membrane with negative charge. 

Thus, in the output membrane we get exactly the strings of terminal symbols 
generated by G, that is L{G) = L{II). □ 

As previously said, the system in the proof takes advantage of the polarization 
of the membranes and of the strings. The polarized strings are sent to membranes 
with an opposite charge (chosen in a nondeterministically way). This feature 
allows one to control the communication of the strings with only two general 
rules (in the skin membrane) , because we do not have to specify the label of the 
membrane where the strings have to go (as in the models presented in 0). With 
one rule we simulate a matrix and with the other we stop the simulation and 
we start the phase in which we control if the string is a terminal one. We can 
control if the string reaches a “wrong” membrane using the rules in the same 
membrane, as we have shown in the proof. 
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Note that the proof presented here does not show how to build a system in 
double_2_normal Jorm starting from a generic RP systems with priority (in a 
direct way). Of course, given a RP system, we can build an equivalent type 0 
grammar, from this we can build the equivalent matrix grammar with appear- 
ance checking and finally we can obtain an equivalent RP system in the normal 
form. It would be interesting to show how to obtain a system in the normal form 
in a direct way. 

4 Rewriting P Systems without Priority 

We consider now RP systems without priority and with External Output. This 
last feature were introduced in and the differences with respect to the model 
previously described are the following: 

— We do not collect the strings in an output membrane. Instead, we consider 
all the terminal strings which are sent out of the system at any time during 
the computation. 

— If a string leaves the system but it is not terminal, then it is ignored; if a 
string remains in the system then it does not contribute to the language 
generated, even if it is a terminal one. 

— We do not consider halting computations: we leave the process to continue 
forever and we observe the terminal strings which leaves the system; the 
language consists of all these strings. 

We show that such a system can be reduced to a system with three rules in each 
membrane. 

In the following we denote with RPE^{nPri,n5) the family of languages 
generated by Rewriting P systems with external output, which use electrical 
charges and which do not use priority relations nor dissolving membrane action. 

Definition 2. A P system is in 5_ normaLform if in each membrane we have 
exactly 3 rewriting rules. 

Theorem 2. . Every language L S RPE^ {nPri, nS) can be generated by a RP 
system of the same type in S^normaLform. 

Proof. Consider a RP system with external output 

n = {V, fi, Ml , . . . , Mk,R \, . . . , Rk) 

We denote with the i — th membrane, and we assume that the skin membrane 
is mi. 

We show how to construct a RP system 

n' = iV',yi',M[,...,Ml,,R[,...,R[,) 

(V = V U {Z, Zo, Z^, Zi, Z_|_, Z-}) in 3_normaLform, which generates the same 
language of II. 
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To explain how we build U , consider a membrane rrii of 77, with i yf 1. Such 
a membrane contains a set of string and a set of rules 7?^. Moreover there are 
a number of cells placed immediately inside this one, some with positive charge 
and others with negative charge. 

First of all, we replace every string w in rui with a string Zw, where Z is a 
symbol not in V. We will use this symbol and the symbols ^o, ^o' > -2^- 

to control the travel of the strings between the membranes. These symbols will 
be deleted before the string leaves the skin membrane, thus their position into 
the strings is of no interest (we need this later in the proof). 

Then we place a membrane around the cells placed immediately inside rrii, 
in such a way that the rules in rrii remain outside this new membrane while the 
membranes in m-i will be inside this new membrane. We label this membrane 
with Qi (to indicate the membrane Around the membrane in i) and we give it a 
positive charge. In the new region just created, we place the rules: 

— Z_|_ — >■ Z{+) , Z- — >■ Z{—) , Zg — >■ Z{out) 

Then, we consider the rules in 7?^. For every rule in Ri, we add a new membrane 
in the region defined by Ri and by the new membrane Oj, and we give negative 
charge to these membranes. If Ri contains q rules, we add q membranes. Each 
membrane will be used to simulate a rule. We denote these membranes with 
mvij, where 1 < J < (? (to indicate the Membrane added in i to simulate the 
Rule j). We have to simulate the rule and the travel of the obtained string 
too. For this reason, we get four different form of membrane (here, out, positive 
charge, negative charge). 

For every rule of the form A — >■ x(here) we add a membrane with the rules: 
A — y Zxi^out^ , A — ^ Z Ai^ovA) , A — y A 
For every rule of the form A — >■ x{out) we add a membrane with the rules : 

— A — )■ Zqx{^ovA) , A — )■ Z Ai^ovA) , A — )■ A 

For every rule of the form A — >■ a:(+) we add a membrane with the rules : 

— A — >■ ZiZ+x{out) , A — >■ ZA{out) , A —y A 

For every rule of the form A — >■ a:(— ) we add a membrane with the rules : 

— A — y ZiZ_x{^ 0 Vj£) , A — y ZAi^ovA) , A — )■ A 

No string is initially present in these new membranes. 

Note that in each membrane there is only one rule (the first one) that effec- 
tively simulates the corresponding rule in m-i. The two other rules are dummy 
rules introduced to get a system with exactly three rules in each membrane. As 
we will see later in the proof, they have no influence on the strings. 

Once we have created these new membranes, we delete all the rule in mi and 
we place in this membrane the rules: 

— Z — y A(-l-) , Zi — y A( — ) , Zq — y Zg(pui) 
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The membrane obtained in this manner is denoted by (to indicate that the 
membrane has a corresponding membrane nii in II, and it is not a new one). 

Thus, all membrane except the skin membrane are changed as described 
above. We modify the skin membrane in the same way with only one difference: 
the rule Zo —1 Z^{out) will not be added in rrii. Instead, we add the rule Zo — i- 
\{out). 

To see how the computation proceeds, consider a string w in a membrane 
TOj. Using a rule in we can replace a symbol in w to obtain a string w' . Then, 
w' can remain in the same membrane and it can be involved in a new rewriting 
rule or it can be sent to the membrane immediately outside or w' can be sent to 
a membrane immediately inside rrii (with positive or negative charge). 

In the corresponding membrane of II (i.e. in m^) we have the string Zw, on 
this string we can apply only the rule Z — i A(+); the obtained string w will be 
sent to a membrane mvij, where we simulate one rewriting rule and we define 
where the obtained string have to be sent: 

HERE If w reaches a membrane corresponding to a rule A — i x(here) and w 
does not contain the symbol A, the string will remain in that membrane forever. 
Otherwise, we have the following possibilities: 

— We can apply the rule A — >■ Zx(put) to simulate the corresponding rule in 
rrii. The obtained string is of the form w\Zw2, where W\W2 = w', and it 
will be sent outside to the membrane m^, where we can apply again the rule 
Z^X{+). 

— We can apply the rule A — >■ ZA{out). The obtained string is of the form 
W3ZW4, where W3W4 = w; it will be sent outside to the membrane m^. The 
only difference between this string and the string Zw is the position of Z 
but, as said before, it is not important for the computation. Thus, this rule 
has no effect on the string. 

— We can apply the rule A ^ A. The string is not modified; it can be used 
many times with this rule, until we apply one of the other two rules, to send 
the string back in membrane m^. 

OUT If w reaches a membrane corresponding to a rule A — >■ x{out) and w does 
not contain the symbol A, the string will remain in that membrane forever. 
Otherwise, we have the following possibilities: 

~ We can apply the rule A — >■ Zox{out) to simulate the corresponding rule in 
rrii. The obtained string is of the form wiZoW2, where W\W2 = w' , and it will 
be sent back to membrane m^. In membrane we have two possibilities: 

• If f 1 (i.e. TOj is not the skin membrane), we have to apply the rule 
Zo — >■ Zo{out). We send the string w\ZoW2 to the membrane immediately 
outside Wj, which is a membrane with a label a-u {v is the label of the 
membrane immediately outside the membrane i in 77). The membrane ay 
is one of the membrane added in 77 , originally not in 77. In membrane 
Uy, we can only apply the rule Zo — >■ Z(out), which sends the string 
w\Zw2 to the membrane outside, which is the membrane m„. Thus, we 
correctly simulate the application of the rule and the obtained string 
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is sent to the correct membrane, where we can start the simulation of 
another rule. 

• If i = 1 (i.e. rUj is the skin membrane), we have to apply the rule Zo — > 
\{out). The symbol Zo is deleted and the obtained string is sent outside 
the system; thus, if the string is a terminal one, it will be in the language 
L(TT'). 

— We can apply the rule A — >■ ZA{out) or the rule A ^ A. The consideration 
for the HERE case is still valid. 

POSITIVE CHARGE If w reaches a membrane corresponding to a rule A — 
a;(+) and w does not contain the symbol A, the string will remain in that mem- 
brane forever. Otherwise, we have the following possibilities: 

— We can apply the rule A — )> ZiZ^x{out) to simulate the corresponding rule 
in rrii. The obtained string is of the form wiZiZ^W 2 , where W\W 2 = w' , 
and it will be sent outside to membrane m^. Here we delete the symbol Zi 
with the rule Zi — > A(— ). The string will be sent to the membrane with 
positive charge in m^, i.e. the membrane Oi. In at we can apply only the rule 
Z-|_ — >■ Z(+). The obtained string wiZw 2 will be sent to a membrane with 
negative charge, and the simulation of the rule A — >■ a::(-|-) is correctly done. 

— We can apply the rule A — >■ ZA{out) or the rule A ^ A. The consideration 
for the HERE case is still valid. 

NEGATIVE CHARGE This case is similar to the previous one. 

Consider now the case that no rule in rrii (membrane of II) can be applied to 
w. The string will remain blocked in rrii forever, thus no string will be generated 
from it. In the corresponding membrane (membrane of 7T ) we have the 
string Zw. We can apply the rule Z — ^ A(-l-) and the string w will be sent 
to a membrane mrij. Because no rule in rrii can be applied to w, there is no 
membrane mrij with a rule that can be applied to it (the left symbol of the 
productions in mrij is the left symbol of a production in rrii), so the string w 
will remain blocked forever in the membrane. 

Another possibility is that the computation on w in 7T never halts, thus no 
string will be generated from w. In 77 we have two possibilities: the computation 
can continue forever by simulating the same rules or it can end, when the string 
w reaches a membrane mra,b with no rules applicable on w. In both cases no 
string will be generated from Zw. 

Thus, the strings that leave the skin membrane in 77 are the same of those 
in 77, that is L{II) = L{II ). □ 

5 Conclusions 

As pointed out in PQ and ^ the theory of computing with membranes misses 
basic tools (necessary conditions and normal form theorems) for producing ex- 
amples and counterexamples. 

We have made a little step in this direction by presenting in this paper two 
normal forms about Rewriting P-systems. The goal is to obtain simple systems 
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that are easy to analyze, in order to understand their generative power and their 
equivalence with other generative mechanisms. For instance, we have shown that 
every RP system of depth two and with two rules in each membrane can generate 
every RE language, by showing that such a system is equivalent to a Matrix 
Grammar System with Appearance Checking; moreover, we have shown that 
every RP system without priority is equivalent to a RP system with exactly 
three rules in each membrane. It is not known if RP systems without priority 
are able to generate RE language or not; we hope that the normal forms given 
here can be useful to give an answer to this question. 
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Abstract. A configuration of a Turing machine is given by a tape con- 
tent together with a particular state of the machine. Petr Kurka has 
conjectured that every Turing machine - when seen as a dynamical sys- 
tem on the space of its configurations - has at least one periodic orbit. In 
this paper, we provide an explicit counter-example to this conjecture. We 
also consider counter machines and prove that, in this case, the problem 
of determining if a given machine has a periodic orbit in configuration 
space is undecidable. 



1 Introduction 

A Turing machine is an abstract deterministic computer with a finite set Q of 
internal states. The machine operates on a doubly-infinite tape of cells indexed 
by an integer % K symbol taken from a finite alphabet S is written on every 
cell; a tape eontent can thus be seen as an element of 

At every discrete time step, the Turing machine scans the cell indexed by 
0 and, depending upon its internal state and the scanned symbol, the machine 
either has no corresponding action, or performs one or more of the following 
operations: replace the scanned symbol with a new symbol, focus attention on 
an adjacent square by shifting the tape by one unit, and transfer to a new state. 
A Turing machine M can thus be given by its transition function 5 m : Q x A — >■ 
Q X E X {—1,0, 1}. A tape content together with an internal state constitute a 
configuration of the machine. A Turing machine thus defines a (partial) function 
f : Q X Q X A^ on its configuration space C = Q x A^. 
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the Belgian program on interuniversity attraction poles lUAP P4-02, and by the 
French Ministry of Education and Research. 
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This is actually one of several possible models of the Turing machine (TM), 
namely the TM with moving tape. Another model can be obtained by making 
the head scan a cell of any index (the tape being immobile), and by encoding 
the head position in the configuration. This is a TM with moving head, and the 
space of configurations has a different topology. We return to this issue in a few 
paragraphs. 

We look at computing machines as dynamical systems on configuration space 
and look at the possible types of trajectories. This approach contrasts with the 
computability one, where the configurations are finite objects (the tape content 
is finite and the rest of the tape is filled with “blanks” ) and where we start the 
process from a precise initial configuration. 

Let / be the function defined by some Turing machine M on its configuration 
space and let c be some configuration of M. The configuration c' = /(c) is the 
successor of c. If / is not defined on c, then c has no successor and is said to 
be terminal. Denote by /* : C — >■ C the t-th iteration of /. A configuration c is 
halting (or eventually terminal) if /*(c) is terminal for some t > 0, it is periodic 
if f*{c) = c for some t > 1, and it is eventually periodic if /*(c) is periodic for 
some t > 0. Configurations that are not halting nor eventually periodic are said 
to be wandering. Thus, a configuration is either halting (1), eventually periodic 
(2), or wandering (3); see Figure 1. 



Fig. 1. The three possible types of configurations: 1. halting, 2. eventually periodic, 
and 3. wandering 

It is clear that a machine has a periodic configuration if and only if it has an 
eventually periodic configuration, and this condition is again equivalent to that of 
the existence of a configuration that leads to periodic sequences of tape contents. 
Little is known about the possible combinations of configurations types defined 
by Turing machine. In |^, Petr Kurka has proved that Turing machines (when 
endowed with a suitable topology) do not have attracting periodic orbits. In the 
same reference, Kurka conjectures the following (see also for a discussion of 
this conjecture): 

Conjecture (Kurka, 1997): A Turing machine that has no halting configura- 

tion has a periodic configuration. 

In this paper, we analyze Kurka’s conjecture and questions related to it. For 
the Turing machine with moving head defined above there is a simple counter- 
example, the machine writing 0 . . . 01, with an increasing number of 0, which 
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answers the question for this model. From now on we study Turing machines with 
moving tape. We first consider counter machines rather than Turing machines. 
A n-counter machine with state set Q can be seen as a dynamical system on the 
configuration space C = Qx N" . The 1-counter machine that keeps incrementing 
its unique counter has no periodic configuration and therefore constitutes an easy 
counter-example to a statement analogous to Kurka’s conjecture for counter 
machines. In Section 2, we prove the stronger result that, in the case of counter 
machines, the problem of determining if a given machine that has no halting 
configuration has a periodic configuration is undecidable. 

In Section 3, we consider the Turing machine model. We provide an explicit 
construction of a Turing machine that has no halting configuration nor periodic 
configuration, thus disproving Kurka’s conjecture. The machine we construct has 
36 states and operates on an alphabet of four letters. 

2 Periodic Configurations for Counter Machines 

A n-counter machine is an abstract deterministic computing machine with a fi- 
nite set Q of internal states and a finite number of registers i?i , . . . , containing 
nonnegative integers. The register values together with the internal state of the 
machine constitute a configuration of the machine. The configuration space of 
counter machines is thus given by C = N" x Q. Depending upon its internal 
state and whether the registers are equal to 0, a machine can perform one of 
the following operations: leave the registers unchanged, increase some register 
Rj by 1, or decrease some register Rj by 1 (assuming Rj yf 0), and move to a 
new internal state k. The transition function of the counter machine is defined 
through instructions. Instructions are tuples 

[q,bi,...,bn,j,D,k] 

where q € Q represents the present state, bi G {true, false} says whether the 
register Ri is equal to 0, j is the index of the register which is modified by the 
instruction, D G {—1,0, -1-1} is the operation affecting Rj, and k G Q is the new 
internal state. For consistency, no two tuples begin with the same n-|-l symbols. 
This definition of a counter machine is slightly different from that given in ^ 
but is easily seen equivalent in terms of computational power. 

As explained in the Introduction, it is easy to construct counter machines 
that only have wandering configurations. Consider for example the 1-counter 
machine with one state, that keeps incrementing its unique register. This machine 
is defined by the two instructions 

[1, true, 1, -1-1, 1] 

[1,/a^se, l,-kl,l] 

This machine has no halting nor periodic configuration; all its configurations 
are wandering. By adapting the proof of Theorem 1 in [21, we prove that dis- 
tinguishing the counter machines that have a periodic configuration from those 
that have not such a configuration cannot be done algorithmically. 
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Theorem 1 . Let M be a eounter maehine that has no halting eonfiguration. The 
problem of determining if M has a periodic configuration is undecidable. This 
problem is undecidable even in the case of 2-counter machines, but is decidable 
for 1-counter machines. 

Proof. The proof is by reduction from the classical halting problem for counter 
machines; see Consider a counter machine M with m internal states la- 
beled qi,q2, . . . ,qm, n registers R\, . . . , Rn and let s = (ri, r2, . . . , r„, gj) be a 
given configuration of M. The instructions of M are, as said before, of the form 
[g*, 61,62, ■ • ■,bnJ,D,qk]- 

To establish the first part of the result we describe how to effectively construct 
a counter machine M' that has no halting configuration, that has n-\-2 registers 
Ri, . . . , Rn,V, W, and that has a periodic configuration if and only if M halts 
on s. 

The machine M' has a special state denoted by qo. Each time when it enters 
the state q^, M' executes a sequence of instructions whose effect is to store 
in Ri, for all f, 1 < i < n, 2 max(l,V) in W and 0 in V. After having done 
this, the machine moves into state g* and from there moves into state qi. (The 
intermediate state g* is only introduced to facilitate the exposition of the proof.) 

Then the machine starts a simulation of the machine M . The simulation 
is such that, before performing any of the instructions of M, the machine first 
increases the value of the register V by 1 , then tests the value of W. If the value of 
W is equal to 0 it returns to the special state qo, otherwise it decreases this value 
by 1 and performs the instruction of the machine M . Thus, the instructions of 
the machine M, which are of the form [qi, 61, 62, ... , 6„, j, D, qk], are all changed 
into twelve instructions for M'\ 

[qi,bi,b2, . . . ,6„,6*+i,6*+2,?t-+ 

[g',61,62, . . . ,6„,6*+i,trMe,n-h2,0,go] 

[g', 61, 62, . . . , 6„, 6;+i, /oZse, n -h 2, -1, q'/] 

[g",6i,62,. . . ,6„,6*+i,6*+2,i,-D,g/c] 

where 6*_|_i and 6*_|_2 range over all four possible combinations, that is 
bn+2 G {true, false}. We complete the construction of M' by adding jumps 
to go from all terminal configurations. 

The machine M' we have constructed has no halting configuration. We claim 
that it has a periodic configuration if and only if M halts on s. 

In order to prove our claim, assume first that M halts on s and let k be the 
number of steps after which it halts. Consider the machine M' at configuration 
c = (ri, r2, . . . , r„, 0 , 2k, g*). Before every step simulation of M, the value of the 
register V is increased by 1 and that of W is decreased by 1 . After k such steps, 
the values of the registers V and W are both equal to k and the machine M' 
jumps to go with the value of V equal to k. From there it is easily verified that 
M' returns to c, and so c is a periodic configuration. 
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For the reverse implication, assume that M' has a periodic configuration. 
We first observe that all trajectories in configuration space pass infinitely many 
often through configurations of the form (ri, r 2 , . . . , 0, 2k, g*) for some fc > 1. 

Indeed, the register W is regularly decremented when executing instructions of 
M' . It is therefore clear that, whatever configuration the machine M' starts from, 
either the machine M' reaches a terminal configuration of M, or W reaches 0 
after finitely many steps. In both cases, M' then jumps to q^, executes a sequence 
of instructions whose effect is to store in i?^, 2 max(l, V) in W, and 0 in 1/ and 
finally moves to g*; thus leading to the configuration (ri, r 2 , . . . , r„, 0, 2k, g*) for 
some A: > 1. 

From this observation we conclude that, if M' has a periodic configuration, 
then it must have one of the form c = (ri, r 2 , . . . , r„, 0, 2k, g») for some fc > 1. 
But a configuration of this type can only be periodic if the machine M halts on s 
after fc steps. Indeed, if M does not halt on s, then the value of 2V + W regularly 
increases (it increases by 1 on each simulation step, and remains unchanged on 
other steps) and c is not periodic. Hence the result. 

For 2-counter machines the proof uses the prime number encoding, and for 
1-counter machines, the presence of a cycle is equivalent to the presence of a 
cycle where the value of the register is bounded by the number of states. □ 

3 A Counter-Example to Kurka’s Conjecture 

We now describe our counter-example to Kurka’s conjecture. As a starting point, 
consider the Turing machine Kq represented on Figure 0 




Fig. 2. The Turing machine Kq. The machine is defined by a labelled oriented graph. 
The vertices represent the states of the machine, and the labels on the arcs are used 
to define the transition function. For example, the label lOA on the arc from state 1 
to state 3 means that, when in state 1 and reading the symbol 1, the machine writes 
the symbol 0 and shifts the tape one unit to the left, in order to read the symbol to 
the right of the 0 it has just written, then goes into state 3. 



The machine has three states and operates on the two-letter alphabet {0, 1}. 
When in state 1, the machine searches for a 1 to the right of the head. Once it 
has found one, it changes it into a 0, writes a 1 to the right of it, and moves 
to state 3 from which it starts a left search for a 1. Once it has successfully 
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completed its search, it returns to state 1. It is easy to verify that this machine 
doesn’t have any periodic configuration, except for the configurations for which 
the machine is in its left or right search state (state 1 or 3) and the tape is a 
tape of O’s. Machines that have search states do in fact always have periodic 
configurations whose associated tape is a periodic tape of symbols that do not 
satisfy the search. It is therefore clear that in constructing a counter-example we 
need to eliminate all search states. The machine we construct below essentially 
performs the same operations as those of Kq, except that it does so by bounding 
its searches. The technique we use for replacing searches by bounded searches is 
adapted from 0. 

The Turing machine Ki we construct has 36 states and operates on the 
four-letter alphabet {0, 1, 2, 3}. The machine is rather involved, and we show on 
Figure 0 only half of the machine; the other half is symmetrical, with L and R 
being interchanged. In order to describe the behavior of the machine, we have 
grouped the states in six groups denoted 1, 2, 3, 1', 2', 3'. Each group i has a 
particular functional purpose, has a unique entry state, qn^ and has two exit 
states: the failed search state and the dispatch state qiQ. 

The group 1 has the same function as the state 1 of Kq (search right for a 
non-zero symbol), the group 2 has the same function as the state 2 of Kq (search 
left for a non-zero symbol), and the state qiQ corresponds to state 3. In order to 
bound the searches without introducing periodic configurations we introduce a 
third group of states (group 3) that has no counterpart in Kq and construct a 
symmetric set of states. 

The technique for bounding the searches consist in using the searched zone 
on the tape as a stack. The stack is used to push the group index of failed 
searches. The corresponding pop operation is performed by the groups 3 (for 
right searches) and 3' (for left searches). The pop is a search operation as well, 
thus a push mechanism for it is also set in place. The states qij with j = 1, 2, 3 
perform the bounded search. The states qij with j = 4, 5 write the 1 to be moved 
by the new search called from qi^, and push the index of the failed search. The 
return from the new search to the calling search, when the one-unit move fails, 
is made from gig (respectively qve) by going to q^i (respectively gs'i)- 

In the sequel, configurations of the machine will be given by expressions 
of the form (“0010100000*^,511). This configuration for example is the one of 
tape content ‘*'0010100000**^, internal state qu, and with the head scanning the 
underlined symbolEl Note that, although we use here the usual notation for 
Turing machines with moving head, this is only for convenience and our model of 
Turing machines remains the moving tape model. Consequently, the underlined 
position always has index 0 and expressions such as (**'0010100000**', gn) and 
(**'0000101000**', gii) represent the same configuration. As an illustration, the 
sequence of configurations obtained when starting from the state gn on a tape 
of O’s is as follows: 



^ For n > 0, the notation 0" is used to denote a sequence of n O’s. We use the notation 
O*** to denote an infinite sequence of O’s. 
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mark tape content 


group 


state 


purpose 


“0000000000“ 


1 


911 


search (like state 1 of Kq) 


“0000000000“ 


1 


9l2 


search 


“0000000000“ 


1 


913 


search and fail 


“0000100000“ 


1 


914 


prepare the new search 


“0000100000“ 


1 


915 


push the index 


* “0010100000“ 


1 


911 


call a right search 


“0010100000“ 


1 


9l2 


search and find 


“0010000000“ 


1 


9l6 


erase and move (like state 3 of ATo) 


“0010010000“ 


2 


921 


go back in order to bounce 


“0010010000“ 


2 


922 


search (like state 2 of Kq) 


“0010010000“ 


2 


923 


search and find 


“0010010000“ 


2 


926 


bounce and jump to right search 


* “0010010000“ 


1 


911 


search right again (like state 1 of itTg) 


“0010010000“ 


1 


9l2 


search 


“0010010000“ 


1 


9l3 


search and find 


“0010000000“ 


1 


916 


erase and move 


“0010001000“ 


2 


921 


go back 


“0010001000“ 


2 


922 


search 


“0010001000“ 


2 


923 


search and fail 


“0011001000“ 


2 


924 


prepare a new left search 


“0011001000“ 


2 


925 


push the index 


“0011021000“ 


1' 


91'1 


call a left search 


“0011021000“ 


1' 


9l'2 


search and find 


“0010021000“ 


1' 


9l'6 


erase, but fail to move 


“0010021000“ 


3' 


93' 1 


thus, pop the index 


“0010021000“ 


3' 


93'2 


by searching for it 


“0010021000“ 


3' 


93'3 


search and find 


“0010021000“ 


3' 


93'6 


read and prepare to return 


“0010001000“ 


2 


921 


pop, and return to left search group 2 


“0010001000“ 


2 


922 


resume the left search 


“0010001000“ 


2 


923 


search and find 


“0010001000“ 


2 


926 


bounce and jump to the right search 


* “0010001000“ 


1 


911 


restart the right search 



In this example, we see in the ^-marked lines that the machine passes through 
the configurations (“0100”10“, gn) for n = 0, 1 and 2. In the next section 
we prove that the machine will then pass through configurations of the type 
(“0100"10“, (jii) for increasing values of n, so that the initial configuration 
(“000“, <7ii) is not periodic; and more generally, whichever configuration it starts 
from, the machine passes through configurations where the head is at the be- 
ginning of increasingly larger blocks of zeros, and so the machine may not have 
periodic configurations. 

We now prove that the machine Ki doesn’t have periodic configurations. Due 
to the symmetry of the machine, the proofs will only be detailed for one half of 
the configurations, the other being inferred by symmetry. We first need auxiliary 
definitions and results. 
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Fig. 3. The Turing machine Ki. Details are given only for half of the machine; the 
other half is symmetrical. L, R and N mean, respectively, right, left and no tape shift. 
We use the symbol * to denote an arbitrary symbol of {0, 1, 2, 3}, and X to denote a 
symbol from {1, 2, 3}. 
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Definition 1. For s > 0, let Q{s) be the proposition: 

“The machine goes from any configuration of the form (. . . 00®Xy . . . ,qn), 
with X G {1,2,3} and Y G {0,1, 2, 3}, to configuration (. . . , qie), 

where the parts of the tape represented by .. . remain unchanged. For i = 2 and 
i = 3, the machine goes from any configuration of the form (. . . . . . , qn) to 

configuration (. . . . . , qio). The symmetric statements also hold.” 

For k >2, let P{k) be the proposition: 

“For any integers t,p,n > 0 such that t + p + n + 2 = k and for any 
X,Y, Z G {1, 2, 3} the machine goes from configuration (. . . XO^QQPZO'^Y . . . , gn) 
to configuration (. . .Xf)^Y . . . , q^e). The symmetric statement also hold. ” 



Proposition 1. Let s > 0. If P{k) is true for all integers k with 2 < k < s, 
then Q{s) is true. 

Proof. The proof is by induction on s. The cases s = 0 and s = 1 can be checked 
by hand by following the machine’s diagram on Figure 0 

Then, to prove the proposition by induction on s it is sufficient to show that 
{Q{s — 1) and P{s)) implies Q{s). We have the following configuration sequence: 



oooo®-2xy . . . 


<7ii 


i 


A few iterations 


ioio®"^xy... 


<7ii 


s 

i 


P{s) (hypothesis) 


iooo®-2xy... 


<736 


i 


One iteration 


oooo®-2xy . . . 


<7ii 


i 


Q{s — 1) (hypothesis) 


.0000®-20F... 


<736 



The proof for the configurations (. . . 00®X . . . , g 2 i) and (. . . 00®X . . . , q^i) is 
similar. □ 



Lemma 1. P{k) is true for all k >2. 

Proof. Define P{k,n) to be the proposition P{k) where n is also fixed. Let 
L = {{k,n) I n > 0,n + 2 < fcj and consider the lexicographical order <^, i.e., 
(k',n') <i (k,n) when k' < k, or when k' = k and n! < n. The order is 
well-founded, therefore allowing us to prove the lemma by induction. 

Fix (fc,n) from L and assume P{k',n') true for all {k',n') G L such that 
{k',n') <L (k,n). Note that this implies, in particular, that P{k') is true for all 
k' < k and so, by Proposition 2, Q{s) is true for all s < /c — 1. 

Case n = 0 By using Q{p), one iteration and then Q(k — 1), we get from 
(. . . X0>=-P-^00PZY . . . , gii) to (. . . XO'^-P-^OOPOY . . . , q^e)- 
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Case n > 0 Again, by using Q{p), one iteration, Q{k — 1 — n), one iteration, and 
then P{k, n — 1), we get from (. . . , qn) to 

(. . . A00^“”“P“^0^000"“^y . . . , 936 )- In both cases P{k,n) holds. This con- 
cludes the proof. □ 

Putting together Proposition 2 and Lemma 1 we immediately conclude: 

Lemma 2. Q{s) is true for all s > 0. 

We now prove that the machine doesn’t have periodic configurations of a 
particular type. 

Lemma 3. The configurations (. . .OO^,^^) with i G {1,2', 3'} and j G {1,2,3} 
are not periodic. This is also true for the symmetric case. 

Proof. Starting from the configuration (. . .00“,9ij) for some i G {1,2', 3'} and 
j G {1,2,3}, the machine goes to configuration (. . . A010‘^,gn) where, depend- 
ing on *, A = 1, 2 or 3. This configuration is of the type (. . . A00^y0‘^, gn) for 
some X,Y G {1, 2, 3}. Using Q{p), one iteration, Q(p+ 1), and another iteration, 
we get from (. . . AOO^POO^, gn) to (. . . A00^010“, gn). By iteratively applying 
this result, we extract from the configuration sequence obtained a subsequence 
of configurations of the type (. . . AOO^IO*^, gn) with strictly increasing values of 
p. Therefore the sequence is not periodic. □ 

In order to prove our main theorem, we need one more lemma. 

Lemma 4. Started in configuration (. . . 00"“^ . . . , g^) or (. . . . . . , gn), 

with n>l,iG {1,2', 3'} and i! G {l',2,3}, the machine goes to a configuration 
(. . .00” . . . ,gji) or (. . .0”0 . . . ,gjvi), for some j G {1,2', 3'} or f G {l',2,3}. 
Here, contrarily to definition^ the parts of the tape represented by .. . can be 
modified in the process. 

Proof. We only consider the cases (. . . 00"“^ . . . , g^i) for i = 1, 2', 3'. Let X be 
the symbol on the tape to the right of 00”“^. If A = 0, we are already in a 
configuration (. . . 00” . . . , g^i). Let us now assume that A G {1, 2, 3}. There are 
three cases to consider. Let Z and T denote arbitrary symbols. 



Case z = 1 . 


. .00 ”-iAZ... 
i 


911 


line lA 
Q(n - 1) 




..00”-i0 y... 


916 


one iteration 




..00”-^01... 


921 


if y = 0 


or . 


..00”-i0 Z... 


931 


if y G {1,2,3} 


Case z = 2' . 


. . yroo”-2oA . 




g 2 'i line IB 
Q(n - 1) 




. . 2T00”-20A . 

i 




92'6 

one iteration 




2T00”-20A . 




9i'i 



one iteration 
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If T 7 ^ 0, this is exactly symmetrical to line lA.If T = 0, we are in the desired 
configuration (. . . 0”0 . . . , 



ZTOO^-^OX . . 


■ 93' 1 








^ — 1 
1 


2T00”-20A . . 


■ 93'6 




i 




one iteration 


ZT00’"-200 . . . 


Qki 





If T = 0 we are done, so let us assume T G {1,2, 3}. The value of k depends 
on X. If X = 1 (respectively, X = 2), then k = V (respectively, k = 2) and we 
conclude with the symmetric of line lA (respectively, line IB). If A = 3, then 
k = 3, and the sequence continues (using Q{n — 1)) to (. . . Z000”“^00 . . . , qji) 
for some j (depending on T), which concludes the proof of the lemma. □ 

We can now finally prove: 

Theorem 2. The machine K\ has no periodic configuration. 

Proof. Consider an arbitrary configuration c. After at most 4 steps, the ma- 
chine enters a state qn for some i G (1, 2, 3, 1', 2', 3'}. A brief case analysis 
shows that if the scanned symbol is non-zero, after at most 4 other steps, no 
matter the symbols around the scanned one, the machine returns in a state 
qn with a zero as scanned symbol. Then we are in the situation described in 
Lemma 0 and for all n the machine will reach a configuration of the form 
(. . . 00 "“^ . . . ,qn) with i G (1, 2', 3'}, up to symmetry. Either at some point the 
machine will reach a configuration of the form (. . . 00“, g^i), up to symmetry, 
which by Lemma 0 is not periodic; or we can extract an infinite sequence of 
configurations (. . . 00”“^A . . . , qn), up to symmetry, with A yf 0 and increasing 
n, so that the initial configuration is not periodic. □ 

Let us notice that while the machine Ki has 36 states and operates on a 
four-letter alphabet, the construction can be changed so that it uses either only 
two letters with more states, or only three states with more letters (by using a 
result from 0). 

4 Discussion 

Various models of computing devices (Turing machines, counter machines, cel- 
lular automata, recurrent artificial neural networks, etc.) are equivalent in terms 
of computational power when they are seen as defining functions from input to 
output. In this paper, we consider the dynamics of some computing devices. We 
do not just look at the relations between inputs and outputs, but also look at 
what happens in between. We also look at configurations that do not correspond 
to a valid input (e.g., infinite tape content). The answer to the question we con- 
sider in this context (the existence of periodic configurations) highly depends 
on the chosen computing model. For example, recurrent neural networks always 
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have a periodic configuration (a periodic configuration is given by the fixed point 
for which all activation levels are set equal to 0). For these networks one can 
also infer from Theorem 1 of P the existence of networks that have no other 
periodic configuration. The situation for cellular automata is quite different. For 
a given cellular automaton, there are only finitely many space-periodic configu- 
rations of a given period. Space-periodicity is preserved through iteration, and 
so all space-periodic configurations of cellular automata are periodic and cellu- 
lar automata always have infinitely many periodic configurations. In the case of 
Turing machine with moving tape, we have shown that, contrarily to what was 
conjectured, not all Turing machines have a periodic configuration. Finally, in 
the case of counter machines we have proved that identifying the presence of 
a periodic configuration is an undecidable task. These few examples show how 
rich and different the dynamics of these computing devices are. 



Acknowledgment 

The authors are grateful to Professor Maurice Margenstern for his suggestions 
of improvement. 



References 

1. V. D. Blondel, O. Bournez, P. Koiran, and J. N. Tsitsiklis. The Stability of Saturated 
Linear Dynamical Systems is Undecidable. Proceedings of STAGS 2000, H. Reichel, 
S. Tison (Eds), Lecture Notes in Computer Science, Springer Verlag, Heidelberg, 
1770 (2000), 479-490. Full version to appear in Journal of Computer and System 
Scienees. 

2. V. D. Blondel, O. Bournez, P. Koiran, Ch. H. Papadimitriou, and J. N. Tsitsiklis. 
Deciding Stability and Mortality of Piecewise Affine Dynamical Systems. To appear 
in Theoretical Computer Science. 

3. P. K. Hooper. The Undecidability of the Turing Machine Immortality Problem. 
The Journal of Symbolic Logic, 31(2) (1966), 219-234. 

4. J. E. Hopcroft and J. D. Ullman. Formal Languages and Their Relation to Automata, 
Addison- Wesley, 1969. 

5. P. Kurka. On Topological Dynamics of Turing Machines. Theoretical Computer 
Science, 174 (1997), 203-216. 

6. C. Moore. Finite-Dimensional Analog Computers: Flows, Maps, and Recurrent Neu- 
ral Networks, in Proc. of the First International Conference on Unconventional 
Models of Computation, Auckland, New Zealand, 1998. 

7. C. Moore. Generalized Shifts: Undecidability and Unpredictability in Dynamical 
Systems. Nonlinearity, 4 (1991), 199-230. 

8. Y. Rogozhin. Undecidability of the immortality problem for Turing machines with 
three states. Cybernetics (Kibernetika), Kiev (USSR) 1 (1975), 41-43. 



On the Transition Graphs of Turing Machines 



Didier Caucal 

IRISA-CNRS, Campus de Beaulieu, 35042 Rennes, France 
caucal@irisa.fr 



Abstract. As for pushdown automata, we consider labelled Turing ma- 
chines with e-rules. With any Turing machine M and with a rational set 
C of configurations, we associate the restriction to C of the e-closure of 
the transition set of M. We get the same family of graphs by using the 
labelled word rewriting systems. We show that this family is the set of 
graphs obtained from the binary tree by applying an inverse mapping 
into F followed by a rational restriction, where F is any family of recur- 
sively enumerable languages containing the rational closure of all linear 
languages. We show also that this family is obtained from the rational 
graphs by inverse rational mappings. 



1 Introduction 

The transition graphs of some classes of machines have already been investigated. 
First, Muller and Schupp have considered rational restrictions of the transition 
graphs of pushdown automata: these graphs are the graphs of bounded degree 
having a finite number of non isomorphic connected components when decom- 
posed by distance from any vertex ; these graphs have a decidable monadic theory 
ng. This graph family is also the set of rational restrictions of the prefix transi- 
tion graphs of finite labelled word rewriting systems [Z1 • Extending to recogniz- 
able labelled rewriting systems, the rational restrictions of their prefix transition 
graphs define a larger family of graphs having a decidable monadic theory jS|. 
To extend this last family, Morvan has defined the family of rational graphs, 
which are the graphs recognized by transducers with labelled outputs This 
family is general: it contains for instance the transition graphs of Petri nets, the 
transition graphs of congruential systems (HI, and the transition graphs of la- 
belled word rewriting systems. Although the rational graphs are recursive, they 
have in general an undecidable first order theory. 

There is a simple and uniform way to present all the previous families of graphs 
from families of languages. Take a family F of languages, a set T of labels and a 
mapping h : T — >F associating to each label a language in F. The inverse image 
h~^{G) of a graph G hy h has an arc s t when there is a path s t in 

G for some word u in h{a). A family F of languages induces a family REGp of 
graphs obtained from the infinite binary tree by marking rationally some ver- 
tices (with a special letter) and then applying an inverse mapping. Then the 
family of rational restrictions of the transition graphs of pushdown automata 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 177-|1^^ 2001. 
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is the family RECpin of graphs induced by the family Fin of finite languages. 
Furthermore, the family of rational restrictions of the prefix transition graphs of 
recognizable rewriting systems is the family RFC Rat of graphs induced by the 
family Rat of rational languages 0. Finally, the family of rational graphs is the 
family RFCj^^^ of graphs induced by a subfamily Lin of linear languages HI 

Thus small families of languages induce large families of graphs. Conversely, a 
family of graphs yields a family of languages. A trace of a graph is the language 
of path labels from and to given finite vertex sets. The traces of finite graphs 
are the rational languages. The traces of graphs in RFCpin are the context-free 
languages which are also the traces of graphs in RFC Rat ■ Finally, the traces of 
rational graphs are context-sensitive languages but the converse is a conjecture: 
is any context-sensitive language the trace of a rational graph ? 

Following the Chomsky hierarchy, we present a general family of graphs, the 
traces of which are the recursively enumerable languages. We consider the off- 
line Turing machines investigated by Mateescu and Salomaa m with a read only 
one way input tape and a unique two ways working tape. These machines are 
particular labelled word rewriting systems allowing rules labelled by e. Follow- 
ing the work of Payet PI and as for prefix transition graphs of word rewriting 
systems, we consider rational restrictions of the e-closure of transition graphs 
of these off-line Turing machines. We show that this family of graphs coincides 
with the family of rational restrictions of the e-closure of the transition graphs 
of word rewriting systems. We also show that this graph family is equal to 
= RFCre meaning that it is induced by any language family be- 
tween the rational closure of Lin and the family RE of recursively enumerable 
languages. Finally, we show that this graph family is obtained by inverse rational 
mappings of rational graphs. 



2 Preliminaries on Graphs 

Let P be a subset of a monoid M, and Idp = { (u, u) | m S P } the identity 
relation on P. A (simple oriented labelled) P-graph G is a subset of VxPxV 
where V is an arbitrary set. Any (s, a, t) of G is a labelled arc of source s, of target 
t, with label a, and is identified with the labelled transition s t or directly 

G 

s A4 t if G is understood. We denote by Vg := { s | 3 a 3 t, s t V t s } 
the vertex set of G. The set 2^^^ of P*-graphs with vertices in P is a 
monoid for the composition GoH := { r t I 3 s, r s A s — ^ t } for 

G H 

any G,H C VxP*xV, where { s — U- s | s G P } is its neutral element. The 
submonoid {G}* of generated by any graph G gives by union the 

graph G* := IJ{G}*. The relation denoted by or simply by if G is 

G* G 

understood, is the existence of a path in G labelled u G P*. For every Q C P*, 
we write s t if there is some u G Q such that s => t. The restriction G\c of a 
P-graph G to an arbitrary set G is G\c '■= Gfl (GxPxG). The labels T(G, E, F) 
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of paths of G from a set if to a set F is the set L{G, E,F) :={ u & M \ 3 s & 
E, 3 t £ F, s t }. A trace of a graph G is the language L{G, E, F) of path 

labels from a finite set if to a finite set F. 

Given an alphabet T and a relation h C TxP, the inverse h~^{G) by h of any 
P-graph G is the following T-graph: 

h~^{G) := { s t \ a £T A 3 u £ h{a), s t } 

A relation h Q TxP can be seen as a mapping from T into 2^ associating the 
image h{a) of any a £ T. When P = S* for some alphabet S, such a mapping is 
extended by morphism to a substitution from T* into S* i.e. a mapping h from 
T* into 2'^* such that h{e) = {e} and h{uv) = h{u)h{v) for every u,v £ T* . 
The composition of functions is the composition of their relations: {g o h){a) = 
h{g{a)). Taking alphabets 0,P,Q, a Q-graph G, and mappings h C PxQ* and 
g C OxP* such that e ^ g{0), we have 

g-^h-\G)) = {g o h)-\G) (1) 

Another basic property is the commutation between the inverse mapping and 
the restriction to particular sets. A set C is stable in a graph G when any path 
between vertices in G contains only vertices in G : 

So ^ Si . . . Sn—\ ^ Sji A SQj S’a £ G x Si, • ■ • , Sj^_i £ C 

G G 

For any stable set C in a P-graph and any mapping h into 2^, we have 

h-\G\c) = (h-\G))\c ( 2 ) 

Another way to express a restriction of an inverse mapping of a graph is to use 
a marking of the graph. The marking #JyG) on a vertex set C of a graph G by 
a symbol # is the graph: 

#^(G) := GU{sAs|sGG} 

obtained from G by adding # to any vertex in G. Any restriction of an inverse 
of a graph is an inverse of a marking of the graph: 

(ft."^(G))|c = 9 ~^{#^{G)) where g{a) = #h{a)# (3) 

For e the neutral element of the free monoid T* generated by an alphabet T, the 
e-closure G of any T U {ej-graph G is obtained by removing the ^-transitions 
and by adding P-transitions as follows: 

G -.= {Id^)-^{G) = {s^t\a£T ^ s^t} 

We compare graphs by isomorphism. A partial isomorphism from a graph G into 
a graph FI is an injective function such that s t /i(s) h{t). An 

isomorphism is a partial isomorphism such that Vq Q Dom{h) and Vh Q Im{h). 
A partial isomorphism on P U {e}-graphs considers the £ label as a new letter. 
To take an £-transition as an internal (silent) move, we compare P U {£}-graphs 
by partial weak isomorphism. A partial weak isomorphism from a graph G into 
a graph FI is an injective function such that s t h{s) h{t). A 

weak isomorphism is a partial weak isomorphism such that Vq Q Dom{h) and 
Vh Q Im{h). Note that is a (resp. partial) weak isomorphism from G into H 
if and only if i/ is a (resp. partial) isomorphism from G into FI . 
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3 Classes of Graphs 

A general way to define a family RECp of graphs from a family F of languages 
is to take the set of graphs obtained from the binary tree (with inverse transi- 
tions) by applying an inverse A-mapping followed by a rational restriction |S|. 
An equivalent way is to take the set of graphs obtained from the binary tree 
by marking a rational vertex set followed by an inverse A-mapping (Proposi- 
tion Id. I ll . We deduce known results on the family RFC pin (Theorem td.'/ill , on 
the family RFC Rat ^Theorem Id. dll , on the family RFCj^^^ where Lin is a sub- 
family of linear languages f Theorem Id. 511 . We deduce also closure properties by 
inverse mappings. 

Let N be an alphabet containing at least two letters. We take a new alphabet 
A^:={ a I a G } in bijection with N. We define the following Dyck graph: 

Apj- := { u ua \ u € N* A a € N } U { ua u \ u € N* A a € N } 
where a representation for N = {a, 6} is the following: 



e 




When L is rational, we say that is a rational marking of Ajq . Let 

Dyckj^ = [Atv] where [G] is the set of all graphs isomorphic to a graph G, 
that we extend by union to any class of graphs. 

We restrict here a language family T’ to be a subset of 2^^^ . A family of lan- 
guages defines a set of mappings: a mapping h is rational (resp linear, . . . ) if for 
any letter a, the language h{a) is rational (resp linear, . . . ). Precisely, a language 
family F and an alphabet T produce the set Fp of mappings defined for every 
o G T by a language h(a) G F. By inverse of a class ^ of (iV U A^)-graphs, we 
get the following class of T-graphs: 

i^-i(^) := { /i-i(G) I G G A G Ft } 

Starting from Dyckj^ , we have two ways to get classes of graphs. Either we 
apply inverse F-mappings followed by rational restrictions |E| : 

F^\Dyckj^)\ := [{ h~\AN)\L \ h € Ft A L € Rat{N*) }] 
or we apply rational markings followed by inverse F-mappings: 

F^\#{Dyckj^)) := [{ /i-i(#JAjv)) \ h € Ft A F G Rat(N*) }] 
Henceforth F will be one of the following language families: the family Fin of 
finite languages; the family Rat of rational languages; the family Lin of linear 
languages; the family RF of recursively enumerable languages; the subfamily 
Lin of linear languages generated by linear grammars such that each right hand 
side is e or of the form uBv where F is a non-terminal with u G N and v G N*; 
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and the rational closure Lin(Rat) of Lin . For these families, the two previous 
classes of graphs coincide and we can also restrict N to have only two letters. 

Proposition 3.1. For any distinct letters a,b G N , we have 

F~\Dyck^)l = F~\Dyck^^i^-^)\ = F-\#{Dyck^^f^-^)) = F~\#{Dyck^)) 

This class is denoted RECp^ or RECp when T is understood. 

The class RECpin is the set of regular graphs (see 0) of bounded degree, 
and we present again two sets of representatives. 

Theorem 3.2. m Given an alphabet N of at least two letters, the following 
properties are equivalent: 

a.) G G RECpin.^ 

b) G is isomorphic to {F[.N*)\]^ for some finite F['GN*xTxN* and 

L G Rat{N*) 

c) G is isomorphic to Ur=i(^* Vi).Wi for some n>0, Oi, ... ,a„ G T, 

Ui,vi,... ,Un,Vn G N* , Wi,... , £ Rat{N*) 

d) G is a regular T-graph of bounded degree. 

The traces of the graphs in RECpin are all the context-free languages. 

Recall that a graph G C N*xTxN* is recognizable if G = Ui Vi U 
. . . U Un Vn for some n > 0, ai, . . . , a„ £ T,Ui,V\, . . . , G„, £ Rat{N*). 

The class REG Rat has been studied in 0 and we present again two sets of 
representatives. 

Theorem 3.3. ISI Given an alphabet N of at least two letters, the following 
properties are equivalent: 

a) G £ RECfiat.^ 

b) G is isomorphic to (F[.N*)\t, for some recoqnizable F[ C N*xTxN* 

and LGRat{N*) 

c) G is isomorphic to [X=iiUi Vi).Wi for some n > 0, oi, . . . , a„ £ T, 

C/i, Ri, VFi . . . , t/„, V„, Wn G Rat{N*). 

The traces of the graphs in REG Rat all the context-free languages. 

Several characterizations of REGpin inside REG Rat have been given in [2j and 
m- By Proposition EH we get closure properties of these two families. 

Proposition 3.4. We have Finj}{REGRin^) = REGpia^ 
and Ratf}{REGFin^ = Ratf} {REG Rat^ = REGRat^ 

It remains to recall the family of rational graphs m We consider a graph as a 
subset of N*xTxN* i.e. a T-graph with vertices in N*. We extend the monoid 
N*xN* to the partial semigroup N*xTxN* defined by 
{u,a,v).{x,a,y) = {ux,a,vy) 
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for every rt, v,x,y G N* and a G T. The extension by union of . to subsets is 
the usual synchronization product for graphs 

G.H = { ux vy I u V A X y } for any G,H C N*xTxN* 

G H 

To this operation is associated the rational family Rat{N* xTxN*) of graphs: it 
is the smallest subset of 2^ xTxn containing the finite graphs and closed by 
U , . , + . A rational graph is a graph isomorphic to a graph in Rat(N* xTxN *) ; 
we denote by RAT^ the family of rational T-graphs. The rational graphs are the 
graphs recognized by the labelled transducers. Precisely, a T-labelled transducer 
is a finite (A^*xA^*)-automaton A = {G,i, {Fa)a^T) with a set Fa of final states 
for each a G T; such an automaton recognizes the graph: 

L(A) := { u AF). | 3 s G i s } 

The family Lin defines by inverse mappings the class of rational graphs. 

Theorem 3.5. {HI fPe have RAT^ = REGj^^^ C 

The traces of the graphs in RAT are context-sensitive languages. 

A conjecture is that the traces of the graphs in RAT are exactly the context- 
sensitive languages. Note that we can have non recursive traces for graphs in 
REGun ■ From the closure by composition of rational relations, the rational 
graphs are closed by inverse finite mappings. 

Proposition 3.6. We have Finj}{RAT^) = RAT^. 

We will now use Turing machines to define a general class of graphs whose the 
traces are the recursively enumerable languages. 



4 Graphs of Rewriting Systems and of Turing Machines 

We consider the rational restrictions of the ^-closure for the set of transitions of 
the labelled Turing machines. We show that this family is the same that for the 
labelled word rewriting systems (Theorem kl.4ll . We show also that this family 
is REGf for any family F of recursively enumerable languages containing the 
rational closure of the linear languages iTheorem 14.511 . Finally, we show that 
this family is the set of the inverse rational mappings of the rational graphs 
(Theorem 14.61 1 . 

The notion of a word-rewriting system is well-known (see for instance the survey 
and ^): it is just a finite set of rules between words. As for the transitions of 
a pushdown automaton, we allow labelled rules, and to any system, we associate 
a rational language of admissible words, usually called configurations, which are 
the words where the rules can be applied. The words are over an alphabet (finite 
set of symbols) N of non-terminals, and the rules are labelled by symbols in an 
alphabet T of terminals, plus the empty word e. 
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Definition 4.1. A finite labelled word rewriting system {R,C) is a couple of 
a finite relation R C N* x(T U {e})xN* and a rational language C Q N* of 
configurations. We write shortly R instead of {R,N*). 

The set of transitions of R is the following (T U {£})-graph: 

T{R) := { xuy xvy \ {u,a,v) & R A x,y & N* } 

The unlabelled transitions of T{R) form the usual rewriting — > of R : 

xuy — > xvy for some (tt, a,v) € R with x,y € N* 

R 

Its reflexive and transitive closure — >■* by composition is the derivation of R. 

R 

To any system (R,C), we associate its transition graph: 

G{R, C) := T(i?)|(^ = { u V \ u v A u,v € C A a € T } 

which is the restriction to C of the e-closure of T{R). In particular G{R) = T{R). 
For instance, the transition relation of a pushdown automaton over a set Q of 
states and over a disjoint set P of stack letters, can be seen as a rewriting system 
(R,G) over N = PUQ where R is in Q.Px(T U {e})xQ.P* , and C is a rational 
subset of Q.P* ■ The closure by isomorphism [G{R, C)] of their transition graphs 
form the family REG Rat ■ 

Another particular rewriting systems are the Turing machines with a read only 
input tape and a working tape [El, |E|. More exactly and given a set Q of 
states, a disjoint set T of input tape letters, and a disjoint set Pq = P U {□} of 
working tape letters, a Turing machine (M, G) is a finite set M of rules of the 
form: 

pA qBS where p,q € Q, a G T U {e}. A, P G Pq , <5 G {+,-} 
with a rational set G of configurations G G Rat{{Q U Pd)*). 

However we are only interested to configurations upv where p G Q and u,v G Pn* 
with u(l), u(|u|) yf □ . Precisely a configuration is of the form ]u]p [u[ where for 
any word u G Pn*, [u[ (resp. ]u]) is the greatest prefix (resp. suffix) of u having 
its last (resp. first) letter distinct of □ i.e. by induction, 

[«□[= [u[ A [u[= u if udMl) yf □ and ]□«]=]«] A ]u] = u if u(l) yf □ 
The set of transitions of M is the following (T U {£:})-graph: 

T{M) := { ]u] p [Av[ ]uB] q [u[ | pA AP). qB+ A u,v G Pa* } 

M 

U {]uG]p[Av[ ]u]q[GBv[\ pA qB- A G G Pa A u,v G Pn* } 

M 

Hence the transition graph of any Turing machine (M, G) is the T-graph: 

G(M,C) := TW)|c 

The transition graph of any Turing machine is the transition graph of a stable 
rewriting system (P, C) meaning that G is stable in T{R) : 
s — >•* r — >■* t A s,t G C => r G G 

R R 

Lemma 4.2. We can transform any Turing machine M into a stable rewriting 
system (R,G) such that T(R)\c is isomorphic to T{M). 

Proof. We take a new symbol $ and the following rational language: 

C = { $upu$ \ pGQ a u,v G Pn* A u(l),w(|u|) A □ } 

We transform any rule pA qB+ of M into the following rules: 
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CpA CBq if CB / $□ 

Cp% CBq% if A = D A CB / $□ 

$pA — ^ $g if B = □ 

$p$ ^ $g$ if yl = B = □ 

We transform any rule pA — ^ qB- of M into the following rules: 

CpAD qCBD if C A BB / □$ 

CpA% qC% if B = □ A C □, $ 

DpA$ — ^ q$ if B = □ 

Cp$ qCB$ if A=D/B A C/$ 

Cp$-^qC$ if A = B = □ A C □, $ 

□j5$ — ^ q$ if A = B = □ 

$pAD $qOBD if BD / □$ 

$p^$ — ^ $g$ if B = □ 

$j3$ ^ $<}DB$ if A = □ / B 

$j3$ — ^ $g$ if A = B = □ 

In this way, we obtain a system R such that C is closed by — > and 
U^V <=^ $U$ $V$ A $U$,$V$€C 

T(M) T(R) 

Thus (B, C) is stable and $T(M)$ = T(B)|c . 

□ 

Conversely and up to the ^-transitions, any rewriting system can be simulated 
by a Turing machine. 

Lemma 4.3. We can transform any rewriting system R into a Turing machine 
(M,C) such that T{M)^(y is isomorphic to T{R). 

Proof. We denote by mi (resp. m 2 ) the maximum length of the left (resp. right) hand 

sides of the rules of R i.e. 

mi = max{ |f/| | 3 a, C, {U, a,V) £ B } 
and m 2 = max{ \V\ \ 3 U, a, {U, a,V) £ B } 

We take two new symbols . and $, and we define the following state set Q of the Turing 
machine to be constructed: 

Q = {.} U U{$})x(Tu{e}) 

We take the following set M' of Turing rules: 

,A — ^ tA+ for A £ Nn 

,A ,A- for A G Nn 

.4 ^ (B, V, a)A+ for A € Na and (B, a,V) (3 R 

{AU, BV, a) A (B, V, a)B+ 

(e,BV,a)A ^ (e,VA,a)B+ for AgN 
(e,BV,a)a ^ (e,V,a)B+ 

(AU, e, a) A ^ (UB, e, a)B+ for B £ N 
(AU,e,a)A ^ (B,$,a)D-b 
(AU,$,a)A ^ (B,$,a)D-b 
(e, $,a)Q — ^ (e,e,a)D-|- 
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In this way, we obtain a Taring machine M' such that for every a G T U {e} and for 
every U,V eN*, 

U-^V <t=^> .U ^ X(e,e,a)Y A [XYl=V 

T(R) T{M') 

We complete M' to M by adding the following rules: 

I (e, e, a) A ,A+ for A G Nn 

The relation h = { {U, •U) \ U G N* } is a partial weak isomorphism from T{R) into 
T(M). More precisely and for every a G T U {e}, we have 

U V .U ^ .V and .U ^ .V U ^ V 

T(R) T(M) T(M) T(R) 

So h is a partial isomorphism from T{R) into T{M) . Thus /i is a partial isomorphism 
from T{R) into where C = Im(h) — ,N* . 

As C N* = Dom{h), the graphs T{R) and are isomorphic. 

□ 

The rewriting systems and the Turing machines have the same transition graphs. 



Theorem 4.4. The Turing machines and the rewriting systems define up to 
isomorphism, the same family of transition graphs, and the traces are the recur- 
sively enumerable languages. 

Proof, i) Let (M, D) be a Turing machine. 

By Lemma 14.21 . we can construct a stable rewriting system (i?, C) and an iso- 
morphism h from T{M) to T{R)\c ■ 

By restriction, h defines an isomorphism from T{M) to T{R)\q . 

By Equation Q, T{R)\c = T^\c = G{R,C). Thus G{M,D) = T(M)p is 
isomorphic (by a restriction of h) to 7’(.R)|cn/i(£)) = G{R,CC\ h{D)). 

ii) Let {R, C) be a rewriting system. 

By Tjemma, l4.dl . we can construct a Turing machine (M, D) and an isomorphism 
h from T{R) to T(M)|^ . Thus G{R, G) = T{R)^^ is isomorphic (by a restriction 

of h) to iW|MC)nD = G{M, h{C) n D). 

We denote by TURINGt the family of T-graphs isomorphic to the transi- 
tion graphs of Turing machines (or of rewriting systems). As for the previous 
graph families (investigated in the previous section), we characterize the family 
TURINGt by inverse mappings of the binary tree. The images of these map- 
pings can be the class of recursively enumerable languages, or can be only the 
class of the rational closure Lin{Rat) of Lin. 

Theorem 4.5. We have TURING^ = = RECre^ 

Proof, i) Let us show that TURING^ C . 

Let {R, G) be a rewriting system: i? is a finite subset of N*x{T U {e})xA^* and 
C is a rational subset of N* . 
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We replace in R the label e by a new letter $ : 

S := { {u, a, v) G R \ a G T } U { (m, $, r>) | {u, e,v) G R } 

So T(S) = where h is the following linear mapping: 

h{a) = { xuv X I (m, a, ?;) S S' A x G N* } for every a G T U {$} 
Furthermore T{R) = g~^(T{S)) where g is the following rational mapping: 
g{a) = $*a$* for every a G T 

By O, we have T{R) = {g o h) where g o h is the following mapping: 

{g o h){a) = h{g{a)) = h(%*a%*) = h{$)*h{a)h(%)* G Lin(Rat) 
Finally and bv ProDOsition tl.il G(R.C) = G . 

ii) C RECfiE because Lin(Rat) C RE. 

iii) Let us show that RECre C TURING . 

T _ 

Let a mapping h : T — > i?if({o, 6, a, 6}*). By Fronosition Id. 1 1 , it is sufficient 
to construct a rewriting system (R,C) such that is isomorphic to 

T{R)^c- 

For every c G T, there is a Turing machine Me : a finite set of rules of the form: 
pA AE). qBS where p,q G Qc, x G {£,a,b,a,b} , A,B G PcA {O} , S G {+,-} 
plus an initial configuration and a set Fc C of final states recognizing: 
h{c) = L{T{Mc),ic,{M<lM\ Q ^ Ec A m, r; S (Pc U □)* }) 

Up to renaming, we may assume that the sets (Pc)ceT, (Qc)cgt are pairwise 
disjoints, and we define the following Turing machine: 

M = { pA ^ qB6 G Me \cGT} 

U { pA -E). q^B5 | 3 c G T, pA qB5 G Me f\ x ^ e } 

We take three new symbols $, &, . and we construct a rewriting system R. First, 
we take the following rules: 

$$ -Ef $j^$ for every cGT 



to describe the moves between two $ of the Turing machines defining h. We 
transform (as in Lemma^2l any rule pA — > qB+ of M into the following rules: 
CpAE^CBq if CB^%U 
Cp% CBq% if A = □ A CP yf $□ 

$pA -Ef $q if B = n 

$p$ -Ef. %q^ if = B = n 

In a same way (and as in the proof of Lemma H.2I1 . we transform any rule 

pA — >• qB- of M into new rules of R. 

For every c G T , q G Qe , A G PeGI {□} , y G {a,b,a,b} , x G {a,b}, we take 
the rules: 



qy ■ 




Aq'y - 


-^q'yA 


x$(?L. 


E^$q 


Wx ■ 


-E^ x$q 


qA- 


E^Aq 


qSz - 


6 

— ^ q 



For the acceptance and for every c G T and A G (|J^ Pc) U {□} , we define 
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q . if q G Fc 

.A ^ . 




For every c G T and u,v G {a, b}*, we have 

u V u%$ =% w$$ 

So h = { (w, M$$) I u G {a, b}* } is a partial weak isomorphism from ^j) 

into T{R). Thus h is a, partial isomorphism from ^j) = h~^{A^^ ^j) 

into T{R). Hence h is an isomorphism from h~^{A^^ into T{R)^q where 

C = Im{h) = {a,b}*n. 

□ 

The class TURING is the closure of RAT by inverse rational mapping. 
Theorem 4.6. We have RaT^^(RATM) = TURINGt- 
Proof, i) TURINGt C Ratf^{RATN). 

Let G G TURINGt - G is isomorphic to T(i?)|p for some rewriting system 
(R,G). Let #,$ be two new symbols. We have 
T{R) = h~\T{S)) 

where h is the rational mapping defined by h(a) = $*a$* for every a G T, 
and S is the system obtained from R by replacing the label e by $ : 

S := { {u, a, v) G R \ a G T } U { (tt, $, u) | (u, £,v) G R } 

By Equations @ and we have 

T(i?)|c. = (t{S) U {m ^ u I u e C}) 

where h^{a) = #$*a$*# for every a G T. 

Obviously T{S) is a rational graph and {u u \ u G G} is also a rational 
graph because C is a rational language. Hence G G Ratf}{RATjs[). 

ii) Raff}{RATN) C TURINGt- 

Let G G Ratf}{RATfq ) : there is a rational Wgraph H and a mapping h from 
T into Rat{N*) such that G = h~^{H). 

By definition of a rational graph, there is an alphabet X and a N-labelled trans- 
ducer A = where K is a finite (X*xX*)-automaton, i is the 

initial state, and for each x G N, Ex is a set of final states, and such that the 

automaton A recognizes the graph L(A) = { u v \ 3 s G E^ , i s } 
isomorphic to H . 

Furthermore and for each a GT, there is a finite Wautomaton {Ka,ia, Fa) rec- 
ognizing the rational language h(a). 

We may assume that the automata {Ka)^^,j, have pairwise disjoint state sets: 
Vxa O Vk^ = 0 for a yf 6. We denote by iF = IJaer and we take a new state 
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We take a new symbol $ and we denote by C = $iX*$ the (rational) configura- 
tion set of the following rewriting system R : 

i ia for each a G T 

$s $(i, s) for each s G 

$s $1 if s G Fa 



R 



{p, s)u v{q, s) if p q and s G 

{p, s)$ t$ if p G Ex and t for some x G N 

K 

As sA for each A G X and s G 



Thus 



= { $m$ 

= { $m$ 
= { $m$ 
= { $m$ 



$«u$ I $zu$ $«u$ I 

T(K) 

$«u$ I 3 s G TIj $iaU$ $su$ } 

T(K) 



$iv$ 

$JU$ 



w 



G h(a) 



w(l) 



L(A) 



L(A) 



Ha) 1 

U => V } 
HA) 



□ 



= $ih ^(L(A))$ isomorphic to h ^{H) = G. 



In particular RAT is not closed by inverse rational mapping. We also deduce 
that the transition graphs of Turing machines are the rational restrictions of the 
e-closure of rational graphs (with e-arcs). 



5 Conclusion 

We have presented a hierarchy of graph families and essentially the family 
TURING of transition graphs of labelled Turing machines with e-rules. In par- 
ticular REGuat is the family of transition graphs of pushdown automata with 
e-rules. Between the lowest family FIN of finite graphs and the greatest family 
TURING , we can show that the two families REG Rat and RAT are natural, by 
considering the Cayley graphs of the word rewriting systems 0, 0 (and a work 
in preparation with Knapik). Finally and using traces, the hierarchy FIN , 
REGRat , RAT , TURING yields a Chomsky hierarchy, except that it remains 
to see whether any context-sensitive language is the trace of a rational graph. 
Another point is to find a subclass of word rewriting systems such that their 
transition graphs are the graphs of REG Lin . 
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Abstract. In this paper we introduce and study a graphical representa- 
tion of the join calculus given by the so called jc-nets. jc-nets provide a 
fully abstract semantics for a Turing complete fragment of the join calcu- 
lus. Thus a new formal model for concurrency and distributed systems 
is introduced. This new formal model simplifies Milner’s vr-nets, while 
preserving their expressive power. 



1 Introduction 

New communication technologies are changing the landscape of computing. The 
Internet has become a platform for large scale distributed programming. Turing’s 
model of computation presents programs as computing functions or relations 
from inputs to outputs. Nowadays we work with new concepts like multitasking, 
distributed systems, mobile agents . . . that have appeared since we deal with 
global computation instead of isolated systems with few interactions with their 
environment. The key feature of this new computation paradigm is that infor- 
mation is flowing around and programs interact each other. Complex behaviours 
arise as the global effect of a system of interacting processes. We need new con- 
ceptual tools, new models and theories to describe and understand, to design 
and build up these new systems. 

Currently there are several languages for programming distributed systems. 
Many of these languages lack a complete formal definition and, in consequence, 
they lack rigorous techniques to reason about program equivalence and program 
properties. We refer in this paper to the Join Calculus Language together with 
its core part called the join calculus. The join calculus could be considered as 
a version of the 7r-calculus and it preserves its expressive power. The 7r-calculus 
fMilfl.JIMilDQj is a term algebra able to express mobility. Interaction in the tt- 
calculus is given by shared names used as I/O channels. The 7r-calculus works 
with channels, guards and messages. In the graphical presentation of the tt- 
terms given by the 7r-nets channels are represented as complex nodes 

called torpedos, guards are represented as boxes and messages are represented 
as directed arcs. In this graphical representation, the boxes hide the internal 
nodes which represent channels. To make available the hidden channels, Mil- 
ner uses a rather complex auxiliary mechanism of links. In order to avoid this 
auxiliary mechanism (that has no corresponding elements in the 7r-calculus), we 
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designed a new mapping such that channels are represented by simple nodes, 
messages are represented by boxes of arcs and guards are represented by arcs 
between boxes. This new mapping simplifies the graphical representation of the 
TT-calculus. Unfortunately it leads to identical representations for processes with 
different behaviours. Interestingly enough, this mapping works well when we deal 
with the join calculus. Using this mapping we introduce and study a graphical 
formalism called jc-nets. We use these jc-nets to interpret a subset of the join 
calculus. Therefore we present a new formal model for concurrent and distributed 
systems that comes with an important structural reduction of the 7r-nets while 
preserving their expressive power. The definition of the jc-nets is given by the 
same algebraic framework used for 7r-nets. 

2 The Join Calculus 

In this section we recall the definition of the join calculus. The join calculus was 
introduced by Fournet and Gonthier in IFGQfij as a core calculus to underlie a 
concurrent programming language. A detailed account may be found in ^nnni. 
Our presentation is based on First we consider a restriction of the join 

calculus. Let X be an infinite countable set of names. The symbols u,v,x,... 
range over names, and u,v,x, . . . range over finite sequences of names. We denote 
by V the set of the join calculus terms which are called processes. The symbols 
P,Q,R, . . . range over processes. 

Definition 1. Processes are given by the following grammar: 

P ::= 0 empty process 



0 is the empty process. An output message u{v) denotes the emission of a name 
V along a channel u. P \ Q represents the parallel composition of P and Q. 
Interprocess communication is achieved via definitions. A definition def u{v) > 
P ±n Q results in a receptor channel u being created for the interaction between 
processes P and Q. Process Q could eventually send a channel name y along 
channel u. At the other end of channel u, process P is waiting for the reception 
of channel names. When the name y, sent by the process Q, is received from 
channel u, process Q will continue its execution in parallel with process {y/v}P 
and the receptor channel u is open to receive new names. 

In the above syntax, only the definition binds names. Thus u and v are bound, 
and the scope of v is P, whereas the scope of u extends to the whole definition, 
a-conversion is defined in the standard way. We write {y/x}P to denote the 
usual capture-avoiding substitution of the free occurrences of name x by name y 
in the term P. Formally, the free names of the processes are defined inductively 



u{v) 

P\Q 

def u{v) > P in Q 



output message 
parallel composition 
definition 



by: 



fn(0) = 0 fn{u{v)) = {u, v} fn(P | Q) = fn(P) U fn(Q) 
fn(def u{v) > P in Q) = (fn(Q) U (fn(P) — {u})) — {u} 
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Definition 2 . Structural congruence = ^ V x V is the smallest eongruence 
relation which satisfies the following axioms: 

Al: def u{v) > P ±n Q = def u{t) t> {t/v}P in Q, if t ^ 

A 2 : def u{v) > P ±n Q = def w{v) > {w/u}P in {w/u}Q 
if V ^ {u, w} and w ^ f n(P) U fn(Q) 

PI: P |0 = P P 2 : P|g = g|P P 3 : (P|g) |P = P| (g|p) 

Dl: Qi I def u{v) > P in g2 = def u{v) > P in (Qi \ Q2) if u ^ 

D 2 : def u{v)>Pi in def w{t)>P2 in g = def w{t)t>P2 in def u{v)>Pi in Q 
ifu^w, u ^ fn(P2), and w ^ fn(Pi) 

Definition 3 . Reduction — >-C V x V is the smallest relation which satisfies: 

R 1 def ui{vi)t>Qi in def U2{v2)>Q2 in . . . def Un{vn)>Qn in (P | Ui{v)) — >■ 
def ui{vi)>Qi in def U2{v2)>Q2 in . . . def Un{vn)>Qn in (P | {v/vi}Qi) 
if {wi+i, ■ • ■ , Un} n (fn(gi) U {rti}) = 0 where i € [n] and n > 1 . 

R 2 Pi — >■ P2 implies def u{v) > g in Pi — ^ def u{v) >Q in P2 
R 3 Pi = gi, gi — >■ Q2, and Q2 = P2 implies Pi — >■ P2. 

We should remark that we have no reduction rule for parallel composition. Such 
a rule is just a consequence. It is proved that if Pi — >■ P2, then g | Pi — g | P2. 
Moreover, for any substitution cr = {y/x}, Pi — ^ P2 implies aPi — >■ (TP2. 

If in the definition of processes we consider messages of the form u{v) and def- 
initions of the form def u{v) > P ±n Q, we get a polyadic version of the join 
calculus. Fournet shows in IMDl that this polyadic version of the join calculus 
can encode the call- by-name A-calculus. Consequently this polyadic version is 
Turing complete. The definitions and results presented in this paper can be ex- 
tended to the polyadic version of the join calculus in the same way the monadic 
TT-nets are extended to the polyadic 7 r-nets [mm . In the full version of the join 
calculus the processes are defined by using two new syntactic categories, namely 
definitions and join-patterns. 



As above, the scopes are determined by terms of the form def D in. P that define 
new channels for the body P where they are used. The set of defined channels 
in D is denoted by dn(P). A join-pattern J binds a set rn( J) of received values. 
We also extend the set fn(P) of the free names. 



P ::= 0 



I u{v) 

I P\P 

I def D in P 



D ::= J>P 
I D^D 



J ;:= u{v) 

I J\J 



rn{x{v)) = {z;} 

rn(J| J') = rn( J) U rn( J') 



dn(a:(h)) = {x} 

dn( J I J') = dn( J) U dn( J') 



fn( J > P) = dn( J) U (fn(P) — rn( J)) 
fn(P A D') = fn(P) U fn(P') 



dn( J t> P) — dn( J) 

dn(P A D') = dn(T)) U dn(P') 



fn( 0 ) = 0 fn(a:({i)) = {a:} U {ii} fn(P | P') = fn(P) U fn(P') 

fn(def D in P) = (fn(P) U fn(P)) — dn{D) 
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When the sequences of names v have the length 1, then we get the monadic 
join calculus. We don’t present here the extension of structural congruence and 
reduction to full join calculus - see IBHI for this extension. 

3 The Control Structure of the jc-Nets 

3.1 Control Structures 

We present the jc-nets as a control structure. First we review the control struc- 
tures. Most of the ideas on the control structures we present in this subsection 
can be found in the paper |MMF95| . A detailed account of the control structures 
may be found in |Mif9ti| . 

From an algebraic point of view, a control structure consists in a set of terms 
together with an equational theory and a reduction relation upon terms called 
reaction. A control structure can be seen as a symmetric strict monoidal cate- 
gory (ssmc for short) with additional structure. The ssmc morphisms denoted 
by a,b,c, . . . correspond to the terms of the control structure (in the algebraic 
setting). They are called actions. The ssmc objects denoted by m,n,k,... are 
called arities. The monoid of the arities (M, e) is assumed to be freely gener- 
ated by a set P. The elements of P, denoted by p, 9 , . . . are called prime arities. If 
the arities m and n are the domain and respectively the codomain of o, we write 
a : m ^ n; with some abuse of terminology, we say that a has the arity m ^ n. 
We write a • 6 : m — ?> fc for the ssmc composition of a : m — >■ n and b : n ^ k, and 
a®b\ m®k^n®l for the ssmc tensorial product of a : m — >■ n and b : k ^ 1. 
By idm : m — >■ TO we denote the ssmc identities, and by „ \ m®n ^ n® m 
the ssmc symmetries. 

A control structure uses a denumerable set X of names denoted hy x,y, z, . . .. 
To define the terms, we need a signature (P, /C), where P is the set of prime 
arities, and /C is a set of control operators. Each name x G X must be equipped 
with a prime p G P, written x : p. A control is used to construct complex terms. 
Each control K G K. must be equipped with an arity rule of the form 

Oi : TOi — >■ TT-I ... Or : ITlr — >■ TO , . 

K{a \, . . . , Or) '. m ^ n 

where x may constrain the value of the integer r and the arities mi,ni,m,n. 
When fixed, r is called the rank of K. To define the reaction, we need a set R 
of reaction rules. A reaction rule is an ordered pair of terms having the same 
arity. The equational theory is common to all the control structures. Besides the 
ssmc and control operators, every control structure contains a datum operator 
(x) : e — >■ p (where x : p), a, diseard operator Wp : p — >■ e, and an abstraetor 
operator ab^: a :p®m^p®n (where x : p and a : m ^ n). Sometimes we 
will omit the arity subscripts. We suppose the terms used in our definitions and 
results are well-formed, and all the equations are between terms with the same 
arity. 

The action terms are constructed by the following grammar: 

a ::= {x) | ujp \ idm | Pm n \ | | aba, a | K{a\...ar) 
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The following derived operator is a new kind of abstractor 

def 

{x)a = aba; a ■ {u>p 0 id„) {x : p, a : m ^ n) 

{x)a (xi) . . . (a:„)a [x = xi . . . x„) 

The equational theory of a control structure is the congruence upon the terms 
generated by the ssmc axioms: 



a ■ id„ = a 
idm ■ a — a 
{a ■ b) ■ c = a ■ {b ■ c) 

a 0 ide — a = ide (g) a 

idm 0 idai = idrTKgjaa 



Pm,n " Pn,m idTT^^Ti 
Pfc®m,n = (idfe ® Pm,n) ' (Pfc,„ ® 
Pm,n ■{b0a) = (a ® &) • Pfe,i 
{a 0 b) 0 c = a 0 {b 0 c) 

{a ■ b) 0 {c ■ d) = {a 0 c) ■ {b 0 d) 



together with the following axioms (where x and y are different in aba, {y) 



idp0{y)): 



abx (x) ■ {ujp 0 idp) = idp 
aba (y) = idp 0 (y) 

a.bx (-‘dq idp LOq 
a.bx id/n idp07n. 



Pm,n — ® Pm,n 

aba (a • 6) = aba a ■ aba b 

aba (a (g) idm) = aba a 0 id™ 

aba aba a — idp 0 aba a 



aba abp a ■ (p^ 0 id„) = (p^^^ 0 id„) • abp aba a {x : p,y : q,y ^ x) 

{{x) 0 idm) • {x)a = a {x : p, a : m ^ n) 

(x) ■ {y){{y) 0 {y}) = (x) 0 (x) (x,y:p) 

{(x) 0 id„) • {y)K{ai,.. .) = K{{(x) 0 id„J • {y)ai , . . .) {x,y : p) 



We use the equality = between two actions a and b if the equation a = b 
can be proved using the above axioms together with the rules of a congruence. 
Otherwise, we write a ^ b. 



Each action a possesses a surface defined by surf (a) = {x G X \ ab^ a yf id(g)a}. 

We introduce a second derived operator by [x/y]a = ((x) (g) id^) • (y)a. Two 
technical results motivate the substitution-like notation chosen for this derived 
operator. The first result shows a kind of a-conversion, and the second result 
shows properties verified by any standard capture-avoiding substitution (with 
the surface understood as the set of the free names). Note that, using this new 
derived operator, the last tree axioms of the equational theory become well- 
known substitution properties. 



The reaction \ is defined as the smallest relation upon the actions which sat- 
isfies the reaction rules R, and is closed under composition, tensor, abstraction, 
and equality. 



3.2 Hypergraphs 

In this section we recall the definition of the hypergraphs together with some 
standard related concepts such as isomorphism, contraction on nodes and edges. 
A rooted hypergraph is a tuple H = (S', V, E, s) where S is a set of hyperedges, 
E is a set of vertices, if C S x E is an incidence relation, and s G S is the 
root hyperedge. The components of a hypergraph H are denoted by Sh, Vh, 
Eh, and sh- For a hyperedge t G Sh and a vertex v G Vh, if (t,v) G Eh 
then we say that “v lies on t” . The graphical representation of a hypergraph 
H is as follows. Hyperedges t G Sh are represented as unfilled ovals with the 
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name t outside. Vertices v G Vh are represented as points carrying the name v. 
Incidences {t, v) G Ejj are represented as tentacles from the oval t to the point 
v\ the length of such a tentacle is often taken to be zero. Finally, the root is 
emphasized by an arrow pointing to the oval sh- 

Example: Fig. Q] shows the graphical representation of a hypergraph H. Here 
Sh = Vh = {v,w,w'}, Eh = {{s,v),{t,v),{t,w),{t' ,w')}, and 

Sh = {t}- The left picture represents El using only tentacles of nonzero length, 
and the right one only tentacles of zero length. Often we use the representations 
of the middle picture as a compromise between size and clarity of visualization. 




Fig. 1. Example of a hypergraph. 



Let iJ be a rooted hypergraph. A nonempty subset of nodes W VVh gives rise 
to a contraction on vertices H/W which is the rooted hypergraph defined by 
Snjw = Sh 

Vh/w = {Vh \ if) U {u}, for a fresh v ^ Vh 

Eh/w = {EH\SHxW)VJ{{t,v) I {t}xIFn£;^^0} 

sh/w = Sh 

A nonempty subset of hyperedges T C Sh gives rise to a contraction on 
hyperedges H/T which is the rooted hypergraph defined by 
Sh/t = {Sh \ T) \J {t}, for a fresh t ^ Sh 
Vh/t = Vh 

Eh/t = {Eh\TxVh)U{ {t,v) \ Tx{v}nEH^^} 
sh/t = if Sh G T then t else sh 

If we consider a hypergraph H, two vertices v,w G Vh, and two hyperedges 
s,t S Sh, then we write for the contraction on vertices H/{v,w}, and 

Hs=t for the contraction on hyperedges H/{s,t}. 

Example: Considering H the hypergraph used in the previous example. Fig. El 
shows a contraction Hu,=w' on vertices and a contraction Ht^t' on hyperedges. 




Fig. 2. Example of contractions on vertices and hyperedges. 



Two hypergraphs H and H' are isomorphic if there exist two bijective functions 
4>S '■ Sh — t Sh' and (j)v ■ Vh — t Vh’ satisfying (psisn) = sh' together with 
(s, v) G Eh if and only if (0s(s), 4>v{v)) G Eh' for all s G Sh and v G Vh- 
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The relation of isomorphism is an equivalence over hypergraphs. For the present 
purpose, the names of hyperedges and vertices have no significance and we do 
not distinguish between isomorphic hypergraphs. We work with the equivalence 
classes of hypergraphs and we write H = H' whenever H and H' are isomorphic. 
For the equivalence classes of hypergraphs we use the same graphical represen- 
tation as for hypergraphs, removing the names of vertices and hyperedges. 

3.3 The jc-Nets 

In this subsection we define the control structure of j c-nets and provide a graph- 
ical representation for these nets. The arity monoid of this control structure is 
considered to be the additive monoid (N, -I-, 0) of the natural numbers. The sym- 
bols m,n,k, . . . range over natural numbers, [n] denotes the first n naturals, i.e. 
[n] = {1, 2, . . . , n}. Given a function / : [n] — 1 F and a natural k, we denote by 
k (B f '■ {k + I, . . . k + n} ^ Y the function defined by {k © /)(i) = /(i — k). The 
control structure is defined over the set X = {zi \ i £ N} of the join calculus 
names. The symbols x,y,u,... range over arbitrary names. The unique prime 
arity 1 is associated with each name x £ X . 

The actions of the control structure are enriched hypergraphs. An action 
a = {H, E) of arity m ^ n consists of a hypergraph H together with a decoration 
E = (l,0,A,r, /i) of H consisting in an injective function I : [m] -£ Vh, a 
function 0 : [n] — 1 Vh, an injective function A : F — >■ Vh with Z C A, a relation 
T C Vh xVh, and a function fj, : Sh — l The functions y can be viewed 

as multisets over Sh x VA x V//. We use multiset operations l±l, — , . . . over these 
functions. We use the notation {x,y,y} for an arbitrary multiset y over the set 
{x, y, z} with y{x) = 1, y{y) = 2 and y{z) = 0. 

The concepts of isomorphism and contraction introduced for hypergraphs 
extend in a quite straightforward way to nets. Let = (Hi,Ei) with Ei = 
Xi,Ti, where i £ [2], We say that ai and 02 are isomorphic if there 
exists a hypergraph isomorphism {(f>s,4'v) between Hi and H 2 such that (pv ° 
Ii = I 2 , <Pv o 0i = D 2 , <Pv o Ai = A 2 , together with (v,v') £ ti if and only 
if (pv(y),<Pv(v')) G t -2 and yi(s,v,v') = y 2 ((ps(s), (pv(v), <Pv(y')) for all s G 
Shi and v, v' G Vh, ■ Just as for hypergraphs, we do not distinguish between 
isomorphic nets. 

We now explain the graphical representations of nets. Let a = {H, E) be a 
net with E = (l, 0, A, r, /i). First we represent the hypergraph H as explained in 
the previous subsection. We suppose that the tentacles in H are of length zero. 
If l(z) = V, 0(fc) = v' and A(a:) = w, then we assign an input label (i) to the 
vertex v, an output label (k) to the vertex v', and a name label x to the vertex 
w. If {v,v') £ T, then we draw an arc outside any oval from the vertex v to the 
vertex v'. It is easy to show that for j c-nets, if y{s,v,v') > 0, then v and v' 
lie on the same hyperarc s, i.e. (s,v), (s,v') £ Eh- Thus, if y{s,v,v') = fc > 0, 
we can draw k arcs inside the oval s from the vertex v to the vertex v' . Similar 
to hypergraphs, for the isomorphism classes of nets we use the same graphical 
representation as for nets; the only thing is to eliminate the names of vertices 
and of hyperedges. 
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We introduce now the control structure operators of the j c-nets. 
The datum (x)'^ = {H, if) : 0 — >■ 1 is defined by 
H = ({s},{z;},{(s,u)},s) 

if = (0,{1 u},{a; u},0,0) - 

The discard = {H, if) : 1 — >■ 0 is defined by 
H = ({s},{u},{(s,u)},s) 
if = ({l^i;},0,0,0,0) 

The three controls which generate the j c-nets are: 

- i/'f = (77, if) : 0 — 1 is defined by 
77= ({s},{u},{(s,u)},s) 
if = (0,{1^^;}, 0,0,0) 



Of, 

o» 

CO. 



- out'*' = (77, if) : 2 — )> 0 is defined by 

77 = ({s}, {u, v'}, {(s, v), (s, u')}, s) 
if = (0, {1 2 H> v'}, 0, 0, {(s, v', ?;)}) 

- If a = (77, if) : 1 — >■ 0, if = (l, 0, A, r, /r) then def^ a = 



(77', S') : 1 — ^ 0 with 



77' = {Sh U {t}, Vh U {f}. Eh U {(t, u)}, t), for fresh t ^ Sh and v ^ Vh 
S' = ({1 !-)■ u},Q,A,rU{(u, I(l))},/x) 

The ssmc operators are defined in the following way. Consider Oi = {Hi, Si) 
with Si = {li, Oi, Ai, Ti, ^i) and Xi : Zi ^ Vff. , where i G [2]. W.l.o.g. we suppose 
SHi = SH^ = s and {Shi ~ {shA) (^{Sh 2 -{sj/aD = 0, as well as Ai(z) = \ 2 {z), 
Vz G Zi r\ Zi and {Vh^ ~ Ai(.^i H Z^,)) H {Vh^ ~ ^ 2 {Z\ D Z^)) = 0. 



- Identity id)), = (77, if) : m — >■ m is defined by 



77= ({s},{?;*|i G [m]}, {{s,Vi)\i G 
S = ({i h-^ Vi\i G [m]},{i H> Vi\i G [m]}, 0,0,0) 

- Symmetry ^ = (77, if) \ m + n ^ n + m\s defined by 
H = ({s},{ui|i G [m-|-n]},{(s,Ui)|i G [m-|-n]},s) 

S = {{i^ Vi\i G[m + n]}, {i i->- Vm+i\i G [n]} VJ {n + i^ Vi\i G [m]}, 0, 0, 0) 



- Tensorial product ai ® a 2 : m + k ^ n + I of ai : m ^ n and 02 : 7 — ? is 
obtained by combining a\ and 02 as follows. Increment with m the input labels, 
and with n the output labels in 02 . Contract the two roots, as well as vertices 
of Oi and of 02 bearing the same name label. Formally, ai (Z 0,2 = (77, if) where 



H = {Shi U Sh2 , Vhi U Vh2 1 Ehi U Eh 2 , s) 

S = ({li U m © I 2 , 0i U n © O 2 , Ai U A 2 , n U T 2 ,Hi W 742 ) 



- Composition oi • 02 : m — 7 of Ui : m — >■ n and 02 : n — >■ 7 is obtained by 
combining a\ and 02 as follows. Contract the two roots, as well as vertices of oi 
and of 02 bearing the same name label. For each i G [n], contract the vertex of 
oi labeled by (i) with the vertex of 02 labeled by (z). Remove the labels (z) and 
(z). Formally, oi • 02 = where 
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H = {Shi U Sh 2 , Vhi U Vh 2 , Ehi U Eh 2 , s) 

E = ({li, D 2 , Ai U A 2 , Ti U T2, Ail W fi2) 

We define the abstractor. Let a = {H,E) : m ^ n with S = (l, 0, A, r, Ai). 
Then abj a:l + m— ^1 + nis obtained from a as follows. Increment with 1 all 
input and output labels. Assign to the vertex labeled by x both the input label 
(1) and the output label (1). Remove the label x. Formally, ab^ a = 
where S' = ({1 >->■ A(a;)} U 1 0 I, {1 !->■ A(x)} U 1 0 0, A — {a; !->■ A(x)}, r, Ai)- 

It is easy to see that the above introduced operators over nets are well-defined, 
except the abstractor. Indeed, ab^ a is not well-defined if the net a does not 
contain a vertex labeled by x. We need an additional technical adjustment. 

def 

Thus we change the definition of the above introduced operators by op(a, . . .) = 
op^(a 0'*' i, . . .) 0'*' i, where op stands for each of these operators and i = {H, S) 
is the net defined by 

H ^ ({s},{u*|i e N},{(s,u*)|i e N},s) 
r=(0,0,{z, ^u,|isN},0,0). 



Proposition 1. The operators {x), uj, v, out, def, id, p, -, 0 and ab^, define 
a eontrol strueture. 

We define the following two derived control operators: 
out„ ‘^= ((it) 0 idi) • out 
def „ a ‘^= (li) • def a 

The operator def can be generalized. If a : m — >■ 0, then def a : to — ^ 0. The 
corresponding graphical representation is extended in a natural way (we use m 
external arcs to connect the new root hyperarc to the old one). We can define a 
derived control operator def„^ by 

a = ((ill) 0 ... 0 (um)) ■ def a 

The reaction \ of the control structure is the smallest relation over jc-nets 
closed under tensorial product, composition, abstraction, and equality, which 
satisfies the following control rule 

outu 0 def u a a 0 def „ a 
The reaction can be generalized in the following way 

out„j 0 ... 0 outu^ 0 def„j...„^ a \ a 0 def„j...u^ a 

4 Expressiveness of the jc-Nets 

We give the semantics of the join calculus by using our jc-nets. It is known that 
the join calculus has the same expressive power as the ir-calculus |Fou99j . The 
results of this section show that our j c-nets have the same expressive power as 
the join calculus. 

Definition 4. The eneoding of the join ealculus processes using the jc-nets is 
defined by 
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1. [0] = ido 

2. [^(z;)] = {v) ■ out„ 

3. [P\Q] = [P] 0 [Q] 

4- [def u{y) 0 P in Q] = ly ■ ('u)([(5] 0 def^ {y)[P]) 

Lemma 1. fn(P) D surf([P]). 

Lemma 2. [{x/y}P] = [x/?/][P] 

Proposition 2. If P = Q, then we have [P] = [Q]. 

Theorem 1. If P ^ Q, then we have [P] \ [Q]. 

Theorem 2. If [P] \ a, then there is a process Q such that P ^ Q and [Q] = a. 

The encoding of the full join calculus by the jc-nets can be easily obtained in 
the monadic version because the number of the defined channels is equal to the 
number of received channels for each join-pattern. If dn(P) = {zti, . . . ,Uk} and 
J = ui{yi) \ . . . \un{yn), then we define the semantics of the full monadic join 
calculus by: 

[0] = ido [w(i’)] = {v) ■ out„ [D A E] = [D] (g) [E] 

[P\Q] = [P] ® \Q] [j\>p] = defui...ur, {yi ■ ■■yn)[P] 

[def D in P] = {v ■■■ 1^1 n) ■ {ui .. . Uk){[D] (g) [P]) 

It is not necessary to encode the join-patterns because the join-patterns interfere 
just by the defined and received channels. 

5 Example: A Chat System 

In this section we implement a simple chat system using our jc-nets. A chat 
system consists of a server and several clients and it is defined by the following 
property: each message coming at the input of any client must be present at 
the output of all clients. Similar to the mechanism of Unix message queues, we 
use a single channel q to exchange messages between the server and the clients. 
A type is associated with each message to allow the server and the clients to 
multiplex messages onto an unique q. Let us consider a chat system made up 
of a server S and two clients A and B. A channel idS is used to provide the 
type of a message from any client to the server. A channel idA (respectively 
idB) is used to provide the type of a message from the server to the client A 
(respectively B). The client A (respectively B) uses an input channel inA and an 
output channel out A (respectively inB and outB). Considering that a message 
m is sent along the input channel inA, the join calculus program corresponding 
to this chat system is as follows: 

Chat = def q{x) \ idS{y) > q{x) \ idA{_) \ q{x) \ idB{_) A 

q{x) I idA{y) > outA{x) A 

q{x) I idB{y) > outB{x) A 

inA{x) > q{x) \ idS{I) A 

inB{x) t> q{x) \ idS{I) 

in inA(m) 
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Gabriel Ciobanu and Mihai Rotaru 




Fig. 3. The jc-net describing the chat system. 



According to the semantics provided, the jc-net of Fig. El corresponds to the 
above join calculus program. Despite the fact that the only proper labels of 
the jc-net are out A and outB, we use other labels as well in order to give a 
clear description of the chat system. The structure of the described jc-net is 
not changing during the reaction steps. Only the root hyperarc is changing. 
Thus we can visualize the reactions of the described j c-net focusing just on the 
changes of the root hyperarc. 




Fig. 4. The reactions of the j c-net describing the chat system. 



^ Generally, the structure of a jc-net may expand and shrink during reaction, namely 
new hyperarcs may appear by hyperarc duplication or existing hyperarcs can disap- 
pear by hyperarc fusion. 
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It is easy to see in Fig. 0that the described system satisfies the chat condition: 
the message m coming along the input channel inA in the initial j c-net is sent 
to the output channels out A and outB in the final j c-net. 

6 Conclusion and Related Works 

We have introduced and studied a graphical representation for the join calculus 
given by the jc-nets. Why is interesting this new graphical formalism? The jc- 
nets avoid irrelevant aspects of 7r-nets without losing their expressive power. 
Moreover, our formalism is presented by the same algebraic framework used for 
the presentation of the 7r-nets. 

Graphical representations for the 7r-calculus or for other process calculi high- 
lights new perceptions and provide more intuition about concurrency and dis- 
tributed systems. While it has always been usual to draw processes as flow-graphs 
for a more intuitive representation, there are, as far as we know, only few papers 
formalizing the graphical presentations of processes. Two graphical formalisms 
were given by Robin Milner: 7r-nets and action graphs |Milt)4IMilfitij . The action 
graphs are graphical representations for action calculi; being too general, they 
are not able to catch specific features of some particular action calculi. Gabriel 
Giobanu and Mihai Rotaru have studied the faithful 7r-nets |gfM| and tree 
resource automata with their graphical representation Barbara Konig 

has developed a different hypergraph-based formalism for mobility, providing a 
hypergraph rewriting semantics for the 7r-calculus pWE99j . 
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Abstract. We show that, in the case of context-free programmed gram- 
mars with appearance checking working under free derivations, three 
nonterminals are enough to generate every recursively enumerable lan- 
guage. This improves the previously published bound of eight for the 
nonterminal complexity of these grammars. This also yields an improved 
nonterminal complexity bound of four for context-free matrix grammars 
with appearance checking. Moreover, we establish nonterminal complex- 
ity bounds for context-free programmed and matrix grammars working 
under leftmost derivations. 



1 Introduction 

Descriptional complexity (or, more specifically, syntactic complexity) is inter- 
ested in measuring the complexity of describing objects (in our case, formal lan- 
guages) with respect to different syntactic complexity measures. In particular, 
very economical presentations of languages are sought for. For example. Shan- 
non im showed the nowadays classical result that every recursively enumerable 
language can be accepted by some Turing machine with only two states. 

Similar complexity considerations may be carried out for any language de- 
scribing device. In the case of grammars, natural syntactic complexity measures 
are the number of nonterminals and the number of rewriting rules. In this pa- 
per, we will consider the nonterminal complexity of certain regulated grammar 
formalisms which characterize the recursively enumerable languages. In the lit- 
erature, several interesting results on this topic appeared in recent years. For 
example, in the case of scattered context grammars, there has even been some 
sort of race for the smallest possible complexity bound, see mm- Here , we 
will concentrate on the question: how many nonterminals must a context-free 
programmed grammar (working under free derivation) with appearance check- 
ing necessarily have in order to be able to generate every recursively enumerable 
language? Previously, a solution using eight nonterminals has been known ^ 
Theorem 4.2.3]. We improve this bound to three by using a rather intricate Tur- 
ing machine simulation. This is our main result. This result could also be useful 
within the emerging area of membrane computing |S|. In fact, Freund and Paun 
derived an upper bound of four for the nonterminal complexity of programmed 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 202-|21^ 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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grammars |H| by using a different simulation technique based on register ma- 
chines. As a corollary, we derive that three nonterminals are enough to generate 
every recursively enumerable language by using context-free programmed gram- 
mars with appearance checking working under leftmost derivations of type 3. 
The same bound was previously claimed for context-free programmed grammars 
without appearance checking working under leftmost derivations of type 2 by 
Meduna and Horvath im Theorem 5] (within Kasai’s formalism of state gram- 
mars [7]). Since we think that the proof given there is incorrect, we give a new 
characterization of the recursively enumerable languages through programmed 
grammars without appearance checking working under leftmost derivations of 
type 2 with four nonterminals based on the construction leading to our main the- 
orem. Similarly, a nonterminal bound of four can be derived for grammars with 
unconditional transfer checking working under leftmost derivations of type 2. 
Our main result also yields an improved nonterminal complexity bound for 
context-free matrix grammars with appearance checking (namely four instead 
of six as previously published in H21, also see Theorem 4.2.3]; independently, 
this bound was achieved recently by Freund and Paun [S|). This bound holds 
for matrix grammars working under free derivations and working under leftmost 
derivations of type 3, as well. Finally, we can derive a first nonterminal complex- 
ity bound for context-free programmed (or matrix) grammars with unconditional 
transfer (working under leftmost derivations of type 3), under the assumption 
that the terminal alphabet is fixed. A long version is available as Technical Re- 
port WSI-2000-26, Wilhelm-Schickard-Institut fiir Informatik, Tubingen. 

We use standard mathematical and formal language notations throughout the 
paper, as they can be found in m In particular: ttj selects the jth component 
of an n-tuple; A denotes the empty word; denotes the reversal of string w. 

2 Turing Machines 

In order to be able to reason more formally, in the following, we give a definition 
of a Turing machine which is adapted to our purposes. The reader can probably 
easily check its equivalence with his or her favourite definition. 

Definition 1. A (nondeterministic) Turing machine (with one one-sided tape) 
is given by M = (Q, E, F,S, go, where Q is the state alphabet, 

S is the input alphabet, F (with S Q F and F C\Q = ^) is the tape alphabet, 
S C Q X F X {L, R} X Q X F is the transition relation, go is the initial state, qf 
is the final state, G T is the left endmarker, ((r € F is the right endmarker 
and ^ € F is the blank symbol. 

A configuration (also ealled instantaneous description) of M is described by 
a word c G with F = F\{#r, #r}. Here c = #Lwqv 

means: The head of the Turing machine is currently scanning the last symbol a of 
)(lW- We now describe possible (and the only possible) successor configurations 
c' of c= fj^Lwqv (given S), written c \~m c' for short: 

F If a a ((r and {q,a, L,q' ,a') G S with a' G F, then for w = w'a, 

c = )(Lw'aqv \~M ((Lw'q'a'v holds. 
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2. If a ^ a ^ ffa and {q,a, R,q' ,a') € S with a' € F, then for w = w'a 
and V = bv' with b € F \ {#l}, c = ff^w'aqbv' \~m ffLwbq'v' holds. 

3 . If a = and ((?, #l, i?, #l) S S, then for w = X and v = bv' with 

b€ r \ {#l}, c = ffhqbv' l-M #Lbq'v' holds. 

4. If a = ffn and {q,ffn,L,q',a') S 5 with a' S F, then for w = w'ffn, 
c = ffLw'ffnq \~M ffhw'q'a'ffu holds. 

As usual, \-*j^ denotes the reflexive transitive hull of the binary relation \~m- The 
language generated by M is given as: 

L{M) = {zn e r* I #LqoflR *LwfqfflR, where ^ € #*}. 

Observe that only the last condition given in the definition of successor con- 
figuration allows for prolongating the working tape. This ability is, of course, es- 
sential for obtaining the power to describe all recursively enumerable languages, 
see also the famous workspace theorem jSj. In the following, TZ£ denotes the 
class of all recursively enumerable languages, which can be characterized as the 
family of languages generatable by Turing machines. 

3 Regulated Grammars 

The notion of a programmed grammar is crucial to this paper. 

Definition 2. A (context-free) programmed grammar (with appearance check- 
ing) is given by a quadruple G = (N,E,P,S), where N is the nonterminal 
alphabet, S is the terminal alphabet, S G N is the start symbol and P is a 
finite set of rules of the form {r : A ^ w,a{r), 4 >{r)) , where r : A ^ w is 
a eontext-free rewriting rule, i.e., A G N and w G {N U if)* (hence, erasing 
rules are permitted), which is labelled by r, and a{r) and 4 >{r) are two sets of 
labels of such context-free rules appearing in P. A ^ w is termed core rule of 
{r : A ^ w,a{r),(p{r)). a{r) is also called success field of r and (f){r) is called 
failure field of r. By A{P), we denote the set of all labels of the rules appearing 
in P. 

For (xi, ri), (x2, ?'2) € (fVUif)* x A{P), we write {x\,ri) (2^2, ^2) iff either 

Xi = yAz, X2 = ywz, (ri : ^ w,a{ri), 4 >{ri)) G P, and r2 G <x{ri) 

or xi = a; 2 , {r\ \ A ^ w,cr{ri),(j){ri)) G P, A does not occur in xi and r2 G 
f>(ri). Let ^ denote the reflexive transitive hull of The language generated 
by G is defined as L(G) = {w € if* | (S,ri) = 4 > (w,r2) for some r\,r2 G A{P)}. 
The language family generated by programmed grammars is denoted by P . The 
language family generated by programmed grammars with at most k nonterminals 
is denoted by 



^ In any case, a number as subscript in the corresponding language class denotation 
will refer to a nonterminal bound for that class. 



Nonterminal Complexity of Programmed Grammars 205 



Theorem 1. TZS = Pg = T’. (see 

Dassow and Paun pose as an open question whether or not that complexity 
bound could be improved. We address this question here. 

In the literature, two variants of programmed grammars are discussed: 

— In a programmed grammar G = {N, E, P, S) without appearance checking, 
for every r G A{P), we have 4>{r) = 0. 

— In a programmed grammar G = {N, E, P, S) with unconditional transfer, for 
every r G d.(P), we have 4>{r) = a{r). 

In several papers, leftmost derivations are introduced, as well. For precise 
definitions of leftmost derivations of types 3 and 2, we refer to m- For better 
distinguishability from leftmost derivations, we will call the derivation relation 
defined above free derivation. Let denote the language family generat- 

able by context-free programmed grammars working under leftmost derivation of 
type 3, i.e., a selected rule (r : A ^ w,a{r),(j)(r)) of a context-free programmed 
grammar is applied to a sentential form a always in a manner choosing the 
leftmost occurrence of T in a for replacement. Let VUT^~^ denote the class 
of languages generatable by context-free programmed grammars with uncondi- 
tional transfer working under leftmost derivations of type 3. Let denote the 
language family generatable by context-free programmed grammars without ap- 
pearance checking working under leftmost derivation of type 2. Such a grammar 
is also known as state grammar mm. Let VUT^ ^ denote the class of languages 
generatable by context-free programmed grammars with unconditional transfer 
working under leftmost derivations of type 2. 

Theorem 2. US = = PUT^-^ = = VUT^-"^ . (see fflu; 

For reasons of space, we do not define context-free matrix grammars formally, 
but refer the reader to p. The language families are denoted by replacing the 
V in the programmed language family denotation with Ai. 

Theorem 3. TZ£ = Me = M = M^~^ = MUT^~^ ■ (see ,11 ) 

4 Main Result 

In this section, we are going to sketch a proof of the following result, thereby 
improving the previously known nonterminal complexity bound considerably: 

Theorem 4. TZ£ = P 3 . 

Due to Theorem^ only the inclusion C has to be shown. 

4.1 Informal Explanations 

We proceed by giving several explanations concerning our construction on a 
rather intuitive level. Of course, we only need to show how to simulate a Turing 
machine generating some language L by a programmed grammar with three 
nonterminals. The three nonterminals of the simulating grammar are: A, B, and 
G. We consider a fixed Turing machine M = {Q, E, P, 5, go, 9/, #l, #fi, #)• 
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Encodings. Given a configuration c S (FUQ)* of M, let (3{c) G {0, 1}* denote 
some binary encoding of c using 7 = |’log2(|G| + |Q|)] many bits per symbol 
from F U Q. Since F contains at least three special symbols, namely, #r 
and and one input symbol and since Q contains at least one symbol, we have 
7 > 3. We interpret strings /3(c) as natural numbers given in binary in a standard 
manner, i.e., 001 for example, would be the three-bit binary representation of the 
number 1. We further assume /3(#) = O'*", /3 (#l) = 0'''“^! and = 10^“^. 

Observe that |/3(c)| = |c|7. Obviously, c can be codified uniquely over the unary 
alphabet {A} by which is not the empty word, because c always starts 

with 

There is a special technique we call “passing over symbols” which we describe 
next. 

Passing over Symbols. How can a tape symbol be “passed over”, e.g., in the 
simulation phase? To this end, consider the following program fragment: 

((p, {(p,2)}, {(p,3)}) 

((p,2) ■. A^C, {(p,l)}, {(p,3)}) 

((p,3) : C -)> H, {(p,3)}, exit) 

Such a program fragment is useful to transfer into with m = [n/2j. If 
n = /3(c), then the rightmost bit of /3(c) is erased. Such a loop is useful in several 
circumstances: 

1. within the simulation loop, in order to pass over symbols which are unin- 
teresting in the simulated step; here, one has to store the skipped symbols 
somehow (to this end, the symbol B is going to be used); 

2. when checking the correctness of the guess on where to actually start the 
simulation of one Turing machine step; 

3. when transforming a codified terminal string, e.g., into w with 

w G r*. 

In each of the three described situations, the two different branches towards 
(p, 3) could be used to test the contents of the currently last bit of the string x 
stored in A^. 

Simulation Loop. Our aim is to give several rules of the programmed grammar 
such that c \~m c' is reflected by (A^^^'^B,p) =5> {A^^^''>B,p'). We assume that 
the Turing machine state q of configuration c is somehow stored in the label p. 

The grammar scans searching for some codification of the subword 

aq, where the assumed input symbol a is guessed nondeterministically. To this 
end, a nondeterministically chosen number of letters is passed over, until the 
guessed subword aq is chosen to be verified. At the end of the verification of the 
correctness of the subword guess (at the chosen position) , a replacement (chosen 
from the possibilities given by S, hence yielding c' from c) is simulated. Finally, 
the intermediate representation has to be converted back into the standard cod- 
ification 
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4.2 A Formal Construction 
Initialization. Consider 

(init : A — >■ simstart(go)i 0) 

as the start rule. Here, simstart is a label set indicating possible starting points 
of the simulation, simstart will be defined formally below. 

Remark 1. There are two intricate differences between programmed and graph- 
controlled grammars: (1) No distinguished start label and (2) no distinguished 
final label are specified in programmed grammars (in contrast to graph-controlled 
grammars). In order to cope with these shortcomings, our simulating program- 
med grammar is designed in a way that the axiom A never directly derives a 
terminal string. Instead, e.g., A — >■ A (as occurring in the explanatory subsec- 
tion O) is replaced by three rules in a sequence: (*) A — >• C, {ii) B ^ B 
(checking for the presence of B) and (Hi) C — >■ A. This sequence can be success- 
fully applied only to sentential forms containing a B. This means that init is the 
only possible starting point. 



Remark 2. One could have avoided such additional complication by considering 
graph-controlled grammars as suggested in |5| as a possible clearer grammatical 
model. In fact, all results of this paper as stated for programmed grammars are 
also valid for this related grammatical mechanism. Freund and Paun indepen- 
dently proved that three nonterminals are enough for characterizing TZ£ with 
graph-controlled grammars. 



Skipping a Symbol. The rules for this task will have the labels (skip, q, i, j), 
with qG<5, l<i <7 and 1 < j < 11. More precisely, we take the following 
rules: 




Here, exit-skip(z) equals {(skip, q,i + ^, 1)} if t < 7 and simstart(q), otherwise. 



Simulation. We must give a separate simulation for each of the four possible 
cases of a Turing machine rule. Fix some rule r = (q, a, A, q', o') G ^ in the 
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following. For reasons of space, we only detail on the first case: consider a € F 
(Recall that F = F \ 1#^,#^}.) and X = L. The other cases can be treated 
similarly. 

Let (3{aq) = f3\ . . . with (5i G {0, 1} and j3{q'a') = f3[ . . . j3'2^ with /?' G {0, 1}. 

The simulation of a Turing step has two sub-phases: firstly, it is checked 
whether aq is codified in the current position (which has been reached by re- 
peated applications of the skip procedure), and then q'a' is generated in its 
place. 

We take the following rules in the checking phase: 



((sim-l,r,i, 1) 
((sim-1, r, i, 2) 
((sim-1, r, i, 3) 
((sim-1, r, i, 4) 
((sim-1, r, i, 5) 
((sim-1, r, i, 6) 
((sim-l,r,z,7) 
((sim-1, r, i, 8) 



A^C, {(sim-l,r, z,2)}, 0) 

A^A, {(sim-l,r, z,3)}, 0) 

C ^ A, {(sim-l,r, z,4)}, 0) 

A^ C, {(sim-l,r,z,5)}, fo,i{(32^-i+i)) 
B ^ B, {(sim-1, r, i, 6)}, 0) 

C -)> A, {(sim-l,r, z,7)}, 0) 

A^ C, {(sim-l,r,z,4)}, /i_i(/32^_i+i)) 
C ^ A, {(sim-1, r, z, 8)}, cont-sim-l(z)) 



Here, for b G {0, 1}, 



f,Ab) 



0, if j yf b; 

{(sim-l,r,z,8)}, if j = b; 



and 



cont-sim-l(z) 



{(sim-1, r, z -I- 1, 1)}, if z < 2y; 
{(sim-1, r, 1, 9)}, if z = 2y. 



Moreover, we take the following rules in the generating phase: 



((sim-1, r, z, 9) : B — >• C'^, {(sim-1, r, z, 9)}, {(sim-1, r, z, 10)}) 

((sim-1, r,z, 10) : C ^ B, {(sim-1, r, z, 10)}, /'(/3')) 

((sim-1, r, z, 11) : H — >■ i?^, exit-sim-l(z), 0) 



Here, 

— / exit-sim-l(z), if 6 = 0; 

{ {(sim-1, r, z, 11)1, if 6 = 1; 

and exit-sim-l(z) equals {(sim-1, r, z -|- 1,9)} if z < 2y and {(return, g', 1)}, oth- 
erwise. In any case, we have 1 < z < 2y and 1 < j < 11. 



Returning to Standard Presentation. The corresponding rules are simply 
obtained by interchanging the roles of A and B in the skipping construction. For 
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q € Q and 1 < j < 10, we take the following rules: 



((return, q, 1) : 
((return, q, 2) : 
((return, q, 3) : 
((return, q, 4) : 
((return, q, 5) : 
((return, q, 6) : 
((return, q, 7) : 
((return, q, 8) : 
((return, q, 9) : 
((return, q, 10) 



B ^ C, {(return, q, 2)}, 0) 

B ^ B, {(return, q, 3)}, {(return, q, 10)}) 
C ^ B, {(return, ( 7 , 4)}, 0) 

A — >• C^, {(return, q, 4)}, {(return, q, 5)}) 
C ^ A, {(return, q, 5)}, {(return, q, 6 )}) 
i? — >■ A, {(return, q, 7)}, {(return, q, 9)}) 
B ^ C, {(return, q, 6 )}, {(return, q, 8 )}) 
A — >• A"^, {(return, q, 9)}, 0) 

C ^ B, {(return, q, 9)}, {(return, q, 1)}) 
C ^ B, simstart(( 7 ), 0) 



Here, 



simstart(( 7 ) = {(skip, g, 1, 1)} 

U {(sim-l,r, 1, 1) | r G (5,7Ti(r) = g,7T3(r) = L} 

U {(sim-2, r, 6, 1, 1) | r G S, 7Ti(r) = q, 7T3(r) = R,bG B\ {#l}} 

U {(sim-3, r, 6, 1, 1) | r G S, 7Ti(r) = g, 7T2(r) = #l, 7T3(r) = R, 
b€R\{#L}} 

U {(sim-4,r, 1, 1) | r G <5,7Ti(r) = g,7T2(r) = = L} 

U {(term, 0,0) | g = g/j. 

(sim-j,. . . ,1,1) is the start rule label for the simulation subprogram of the jth 
form of a Turing machine rule. Observe that only in the case when the final state 
has been reached, the first termination rule may be selected as the next rule to 
be applied after finishing a simulation loop. 



Termination Rules. Firstly, we check in some preparatory steps whether there 
is at least one A and exactly one B in the string. Then, we continue checking for 
the occurrence of at the rightmost position of the simulated Turing tape. 

((term, 0 , 0 ) : H -)> H, {(term, 0 , 1 )}, 0 ) 

((term, 0 , 1 ) : B -)> C, {(term, #a, 0 , 2 )}, 0 ) 

((term, ^r, 0,2) : B ^ B, 0, {(term, #a, 0, 3)}) 

((term, #a, 0 , 3) : C H, {(term, #a, 1 , 1 )}, 0 ) 

Now, let if = 27 U {#l, #i?, #, g/j be the set of symbols admissible in a 
configuration whose tape contains a terminal string. In addition, for a G 27\{^l} 
with P{a) = (ii . . . Pj, Pi G {0, 1}, and for 1 < z < 7 , we have: 



((term 


a 


i, 1 ) 


A 




C, {(term. 


a 


*.2)1, fo,i{Pj- 


-i+l)) 


((term 


a 


z, 2 ) 


B 




B, {(term. 


a 


z,3)}, 0) 




((term 


a 


z,3) 


C 


-)> 


A, {(term. 


a 


*,4)},0) 




((term 


a 


z,4) 


A 


-)> 


C, {(term. 


a 


*, 1 )}, fiAP'f- 


-i+l)) 


((term 


a 


z,5) 


C 


-)> 


A, {(term. 


a 


z,5)}, {(term, 


a,i+ 1 , 1 ) 1 ) 
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Here, for b G {0, 1}, 



fjAb) 



0 , if j ^ b; 

{(term, a, i, 5)}, if j = b. 



Similarly, the first bit is finally checked: 



((term, a, 7 , 1) : A ^ C, {(term, a, 7 , 2)}, /o,i(/3i)) 
((term, a, 7 , 2) : H — >■ B, {(term, a, 7 , 3)}, 0) 
((term, a, 7 , 3) : C — >■ A, {(term, a, 7 , 4)}, 0) 
((term, 0 , 7 , 4) : H C, {(term, a, 7 , 1 )}, /i,j(/3i)) 



Then, different things may happen, depending on which tape symbol has been 
currently read: 



((term, #a, j,5) : C ^ A, {(term, 7 , 5)}, {(term, qj, 1, 1)}) 
((term, g/, 7 , 5) : C ^ A, {(term, g/, 7 , 5)}, T({#h})) 

((term, #, 7 , 5) : C ^ A, {(term, #, 7 , 5)}, T({#fl})) 

((term, a, 7 , 5) : C — >■ H, {(term, o, 7 , 5)}, {(term, o, 7 , 6 )}) 

((term, a, 7 , 6 ) : 5 ^ oB, T({#, #^1), 0) 



where a G S and T{X) = {(term, o, 1, 1) | o G B \ X} for X C S. 

Finally, we check the codification of the leftmost tape symbol, i.e., and 
yield the terminal string if everything was all right up to now. 



((term, 1 , 1 ) : H C, {(term, #l, 1 , 2 )}, 0 ) 

((term, 1, 2) : B A, {(term, #l, 1, 3)}, 0) 

((term, #L, 1,3) : C ^ X, {(term, #l, 1, 4)}, 0) 

((term,#L, 1,4) : H H, 0, {(term, 1, 1)}) 



4.3 Comments on the Correctness of the Construction 

In principle, a configuration c of the simulated Turing machine (which is in 

state g) is codified by at any time before the simulation enters a rule 

from simstart(g). By induction, it can be shown that 

1 . a complete loop entering (skip, g, 1 , 1 ) and heading for some rule from 

simstart(g) \ {(skip,g, 1 , 1 )} (which does not enter a rule from simstart(g) 
in-between) converts a string of the form into a string 

^ w,u G r*, a G r, unless a = and w = X; 

2 . the rules whose labels start with sim-i correctly simulate an application of 
a rule of type i of the Turing machine; 

3. a string of the form is correctly converted into a string 

A^(’"“)B by repeated applications of rules whose labels start with return; 

^ Here, for some x G {0, 1}* is the string of B’s obtained by interpreting the 
binary string lx as a binary number. 
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4. (only) a codified tape of the form c G for some w & S, i.e., 

B can be correctly transformed into the terminal string w, 

5. all sentential forms generatable by the programmed grammar are of the form 
A+{BUC)+ U {AUC)*E*B+ U E*. 

Basically from these considerations together with Remark ^ the correctness 
of the proposed construction may be inferred. 

5 Further Consequences 

5.1 Leftmost Derivations 

A natural variant of context-free programmed grammars with appearance check- 
ing is to consider them working under leftmost derivations of type 3, as was 
already done in the very first paper on programmed grammars US). Since the 
construction in our main theorem can also be viewed in a leftmost fashion, we 
can conclude: 

Corollary 1. TZE = . 

Meduna and Horvath Theorem 5] previously considered the nonterminal 
complexity of context-free programmed grammars without appearance checking 
working under leftmost derivations of type 2 (within Kasai’s formalism of state 
grammars 0). They claimed that, for these grammars, three nonterminals are 
enough to generate every language from TZ£. Unfortunately, the coding trick 
used in m Theorem 5] does not work properly]^ Slight modifications of our 
main construction lead to: 

Theorem 5. TZ£ = 

Similarly, we can show: 

Corollary 2. TZ£ = VUTi~^ . 

5.2 Matrix Grammars 

Similarly, one could consider matrix languages instead of programmed languages. 
Due to P] Lemma 4.1.4], matrix grammars can simulate programmed grammars 
at the expense of one additional nonterminal. Therefore, we can state: 

Corollary 3. TZ£ = M .4 = 

This improves the previously published bound of 6 nonterminals for A4, see 
Freund and Paun obtained a matching result for matrix grammars 

® The reader who wishes to study the proof of Meduna and Horvath should consider 
the possibility that a two-letter sentential form AB codified as 01001 in the simula- 
tion will yield 00101 after simulating the rule B — >■ B (if no other applicable rule is 
in the present “state”). 
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5.3 Unconditional Transfer 

We are now going to bound the nonterminal complexity of context-free pro- 
grammed grammars with unconditional transfer working under leftmost deriva- 
tions of type 3, which were shown to be computationally complete in by an 
intrinsically non-constructive argument. 

In this section, let C{S) denote the restriction of the language class C on 
languages over the alphabet E. 

Recall the notion of division ordering: for u,v € E*, u = ui . . .Un, Ui G E, 
we say that u divides v, written u\v, if u G E*uiE* . . . E*UnE* . u is also called 
a sparse subword of v in this case. The famous Theorem of Higman states that 
every L Q E* has a finite subset L' such that every word in L has a sparse 
subword in L'. If I{u) = {u G if* | u\v} denotes the ideal of u, then Higman’s 
Theorem gives the following presentation of L: L n /(m)). Let us 

call L' a Higman basis of L. This presentation has been one of the ideas for 
showing the computational completeness of More precisely, it is clear 

from our quoted construction that the nonterminal complexity of the constructed 
grammar basically depends on three parameters: (1) the nonterminal complexity 
of the simulated grammar, (2) the size of the alphabet of the language and (3) 
the maximal length of a word in a Higman basis of the language. This is still true 
when thinking about a simulation of 'P^~^ grammars instead of starting from 
type-O-grammars in Kuroda normal form, as we did in We will give details 
of such a construction below. This means that we can consider parameter (1) as 
a constant due to our main theorem. Since we keep (2) fixed by definition in the 
following, we only need to worry about (3). The key observation is therefore: 

Lemma 1. Let E be an alphabet. Then, there is a constant n^x;\ such that every 
recursively enumerable language L C E* possesses a Higman basis L such that 
every word w € L obeys |w| < n\s\. 

A sequence of modifications of the proof of the main theorem yields: 
Theorem 6. VA3c > 0 : VUTi~^{E) = n£{E). 

Admittedly, the dependence on the size of the terminal alphabet in the pre- 
vious theorem appears to be somewhat peculiar and seems to be special to 
programmed languages with unconditional transfer. Note that the construction 
used in ^ entails a dependence on the minimal size of a Higman basis of a 
language. As the example = [J^^iiaj | j > 1} (with minimal Higman ba- 
sis {fli, . . . , a„}) shows, the size of minimal Higman bases will grow arbitrarily 
large with growing terminal alphabet size. It is still an open question whether 
a bound on the nonterminal complexity of context-free programmed grammars 
with unconditional transfer working under leftmost derivation of type 3 can be 
derived without limiting the size of the terminal alphabet. 

Similarly, matrix grammars with unconditional transfer can be considered, 
see 0. 
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6 Concluding Remarks 

We briefly discuss whether our main Theorem 0 and its consequences can be 
further improved. It is known ^ Theorem 4.2.2] that there exists a regular 
language in 7^2 \ ’^i- Hence, the chain Vi <Z V2 Q = TZS poses only one open 
problem. We conjecture that V 2 Q V 3 is strict. Similarly, most other bounds 
derived in this paper are rather sharp. Nonetheless, it would be challenging to 
really prove these sharpness conjectures. 

It would be interesting to know whether a nonterminal complexity bound 
dependent on the size of the nonterminal alphabet can be found for context-free 
random context grammars with appearance checking. Note that an upper bound 
independent of the alphabet size does not exist PJ Example 4.1.1]. 

Acknowledgments. We are grateful for immediate answers of our colleagues 
H. Bordihn and Gh. Paun concerning questions on syntactic complexity and for 
some discussions with R. Freund and F. Stephan. 
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Abstract. We improve the results elaborated in jS] on the number of 
non-terminal symbols needed in matrix grammars, programmed gram- 
mars, and graph-controlled grammars with appearance checking for gen- 
erating arbitrary recursively enumerable languages. Of special interest 
is the result that the number of non-terminal symbols used in the ap- 
pearance checking mode can be restricted to two. In the case of graph 
controlled (and programmed grammars) with appearance checking also 
the number of non-terminal symbols can be reduced to three (and four, 
respectively); in the case of matrix grammars with appearance checking 
we either need four non-terminal symbols with three of them being used 
in the appearance checking mode or else again we only need two non- 
terminal symbols being used in the appearance checking mode, but in 
that case we cannot bound the total number of non-terminal symbols. 



1 Introduction 

The motivation for the present work comes from two directions. On one hand, it 
improves an “old” result in the area of regulated rewriting ( 0 ) j of a rather basic 
nature (minimizing the number of non-terminal symbols necessary in order to 
generate an arbitrary recursively enumerable language by a matrix grammar with 
appearance checking) . It should be emphasized that the number of non-terminal 
symbols is a basic descriptional measure of Chomsky grammars and of their 
extensions. On the other hand, as matrix grammars recently are widely used for 
proving universality results in the area of membrane systems (P systems), see 0, 
and the number of non-terminal symbols used by matrix grammars in the so- 
called appearance checking mode is of crucial importance in obtaining universal 
P-systems with a small number of membranes (see the technique introduced in 
^), this “old” problem becomes of a renewed interest. 

* Work supported by Bundesministerium fur Bildung, Wissenschaft und Kunst, Aus- 
tria, grant GZ 55. 300/269- VII/D/5a/2000 and by a grant of the NATO Science 
Committee, Spain, 2000-2001 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 214-|22^ 2001. 
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In 0 it was shown that six non-terminal symbols are enough in matrix 
grammars with appearance checking for generating any arbitrary recursively 
enumerable language; all six variables were used also in the appearance checking 
mode. The purpose of this paper is to improve this result on the number of non- 
terminal symbols, i.e., we shall show that four non-terminal symbols, only three 
of them being used in the appearance checking mode, are sufficient. If we allow 
an arbitrary number of non-terminal symbols, then the number of non-terminals 
being used in the appearance checking mode can even be reduced to two. This 
result also allows us to establish a stronger version of the binary normal form 
for matrix grammars with appearance checking which we shall call the strong 
binary normal form. This special result has already been used in some other 
papers, especially in the area of P-systems, e.g., see P]. For graph-controlled 
and programmed grammars with appearance checking the total number of non- 
terminal symbols can be reduced to three and four, respectively, and only two 
of them need to be used in the appearance checking mode anyway. 

The basic result we use for proving our main theorems has already been 
known since many years, i.e., in jS| it was shown that the actions of a Turing 
machine can be simulated by a register machine with only two registers. The 
actions of such register machines can easily be simulated by graph-controlled 
grammars with appearance checking (and programmed and matrix grammars 
with appearance checking as well); the main difficulty that remains is to generate 
the terminal string and at the same time also the corresponding encoding for 
the simulation of the register machine by having only two non-terminal symbols 
being used in the appearance checking mode. 

Independently, similar results for bounding the total number of non-terminal 
symbols in programmed and matrix grammars with appearance checking have 
been obtained by Henning Fernau in 0 , who uses a tricky Turing machine simu- 
lation for obtaining his main result for programmed grammars with appearance 
checking, where he even needs only three non-terminal symbols, yet all of them 
are used in the appearance checking mode. On the other hand, he has to use all 
four non-terminal symbols in the appearance checking mode in his construction 
for matrix grammars with appearance checking, which result is not optimal for 
the purposes needed in the area of membrane computing (see H)- 

The rest of the paper is organized as follows: In the next section we give 
the definitions for arbitrary grammars and graph-controlled, programmed and 
matrix grammars with appearance checking as well as for register machines. The 
main result of this paper is established in section three, i.e., we show that for 
each recursively enumerable language there exists a graph-controlled grammar 
with appearance checking with only three non-terminal symbols and with only 
two of them being used in the appearance checking mode. In the fourth section 
we establish the results for matrix grammars with appearance checking. We 
conclude with a short summary of the results elaborated in this paper and with 
some aspects of related and future research. 
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2 Definitions 

In this section we first recall some well-known notions from formal language 
theory and then define the main control mechanisms to be considered in this 
paper, i.e., we define graph-controlled, programmed, and matrix grammars with 
appearance checking. Finally we introduce the model of register machines we are 
going to simulate for establishing our main results. 



2.1 Formal Language Prerequisites 

The reader is assumed to be familiar with the basic notions of formal language 
theory, e.g., see and |^. We only give the following definitions: 

For an alphabet V , by V* we denote the free monoid generated by V under 
the operation of concatenation; the empty word is denoted by A, and V~^ := 
V* \ {A}. For any word w € V* and any X G V, represents the number of 
occurrences of the symbol X in w. The set of non-negative integers is denoted 
by INq, and by RE we denote the family of recursively enumerable languages. 



2.2 Grammars with Coutrol Mechanisms 

In this subsection we give the definitions of the main mechanisms of regulated 
rewriting we shall consider in this paper, i.e., we define graph-controlled gram- 
mars, programmed grammars, and matrix grammars with appearance check- 
ing. As in this paper we only deal with such grammars using the appearance 
checking mode, in the following we shall omit this additional information, thus 
only speaking of graph-controlled grammars, programmed grammars, and ma- 
trix grammars, respectively. As is well known (see 0), each of these control 
mechanisms allows us to obtain RE with context-free productions. 



Graph- Coutrolled Grammars. A graph- controlled grammar is a construct 
Gc = {N, T, (R, Lin, Lfin) , S) ; N and T are sets of non-terminal and terminal 
symbols, respectively, with A^flT = if); S G N is the start symbol; i? is a finite set 
of rules r of the form {I (r) : p {I (r)) , a {I (r)) , (p {I (r))), where I (r) G Lab (Gc), 
Lab (Gc) being a set of labels associated (in a one-to-one manner) with the rules 
r in R, p {I (r)) is a context-free production over NUT, a{l{r)) C Lab (Gc) is 
the success field of the rule r, and p{l(r)) is the failure field of the rule r; 
Lin Q LabiGc) is the set of initial labels, and L/m C Lab{Gc) is the set of 
final labels. For r = (l{r) : p {I (r)) , a {I (r)) , p {I (r))) and v,w G {N U T)* we 
define (v, I (r)) =^Gc if only if 

— either p {I (r)) is applicable to v, the result of the application of the produc- 
tion p{l{r)) to V is w, and k G a (I (r)), 

— or p {I (r)) is not applicable to v, w = v, and k G p(l (r)). 
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The language generated by Gc is 

L (Gc) = {w €T* \ {S, lo) =^Gc (wiji) . . . =^Gc (wkjk) , k> I, 

Wj £ {N U T)* and Ij £ Lab (Gq) for 0 < j < k, 

wq = Wk = tn, Iq £ £ Lfin} . 

A non-terminal symbol A G iV is said to be used in the appearance checking 
mode, if there exists at least one rule {I : p,a (1) , p (/)) G R such that p is of the 
form A — >■ a with a £ {N U T)* and p (1) ^ 0. 

Programmed Grammars. A programmed grammar is a construct Gp = 
{N,T, R, S) such that Gc = {N,T, {R, Lab (Gc) , Lab (Gc)) , S) is a graph- 
controlled grammar, where Lab {Gc) is the set of labels associated (in a one- 
to-one manner) with the rules r in R; i.e., in a programmed grammar we do 
not specify initial and final rules. As we shall see later on in this paper, the 
major drawback of programmed grammars in comparison with graph-controlled 
grammars with respect to our goals is the absence of the possibility to specify 
initial rules. 



Matrix Grammars. A matrix grammar is a construct Gm = (fV, T, (M, F ) , S) 
where N and T are sets of non-terminal and terminal symbols, respectively, 
with A^nT = 0, 5'GA^is the start symbol, M is a finite set of matrices, 
M = {mi I 1 < t < n}, where the matrices are sequences of the form = 
ni > 1, \ < i < n, and the mij, 1 < J < n-i, 1 < t < n, are 
context-free productions over N\JT, and F is a subset of Ui<i<n i<j<rH 

For mi = . . . , mi^m) and v,w £ {N U T)* we define v =^mi w if and 

only if there are wq, wi, . . . , Wm £ {N U T) such that wq = v, = w, and for 
each j, 1 < j < Ui, 

— either wj is the result of the application of mij to Wj-i, 

— or mij is not applicable to Wj-i, Wj = Wj-i, and mij G F. 

The language generated by Gm is 

L {Gm) = {w £T* \ S =^mi^ Wi... =^7ru^ Wk, Wk = W, 

Wj £ {N U T)* , TOij £ M for 1 < J < fc, /c > l} . 

A non-terminal symbol A G iV is said to be used in the appearance checking 
mode, if there exists at least one production of the form A ^ a, a £ {N U T)*, 
that appears in F. 



2.3 Register Machines 

In this subsection we introduce the model of register machines that we shall use 
to establish our main result in the next section. 
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The main idea of a register machine is to use registers for storing non-negative 
integers (in a unary representation) and to use very simple operations to change 
the contents of a register; usually these operations are the addition of one to 
and the (conditional) subtraction of one from the contents of a register. With 
these instructions we now can write programs computing functions on natural 
numbers. 

We shall introduce a model which (syntactically) is even more powerful than 
the models used in [^, hence all the results obtained there with those models 
can also be obtained with this model: 

An n-register machine is a construct M = (n,R,i,f) , where n is a natural 
number, i.e., the number of registers the machine may use; i? is a set of labelled 
program instructions of the form k : (op (i) , I, m) such that op (i) is an operation 
on register i of M and k, I, m are labels from a set of labels Lab (M) (Lab (M) 
labels the labelled program instructions of M in a one-to-one manner), k ^ f, 
I is the label for continuing the program if op (i) can be applied to register i 
and m is the label for continuing the program if it is not possible to apply op (i) 
to register *; to the final label f we assign the program instruction end, which 
halts the program of the register machine M; i is the initial label to start the 
program. 

We will use the following program instructions (op,l,m) : 

— (S (i) ,l,m) ii possible (i.e., if the contents of register i is greater than zero) 
subtract one from register i and go to label I, otherwise skip, i.e., do not 
change the contents of register i, and continue with the instruction at label 
m; 

— (A (i) ,h,h) : add one to register i and continue the program with the in- 
struction at label h\ obviously, the operation A (i) is always possible, hence 
both labels where to continue have to be the same. 

In the models used in |^, h as well as one out of {I, m} had to be fc -|- Iwith 
k being the label of the labelled program instruction under consideration; yet 
we shall not impose such restrictions on our model here in this paper, because 
our simulations can deal with these extended features without any problems. 
Moreover, writing programs for a register machine becomes easier without such 
restrictions. 

An n-register machine M now can be used to compute a partial recursive 
function / : INg — INq in the following way: 

M starts with m £ INg in register 1; if M halts in the final label / and 
with the contents of register 1 being r, then we say that M has computed 
/ (m) = r, otherwise, if M does not halt in the final label / when started with 
m in register 1, then / (m) remains undefined. 



As a very simple example to explain our model we construct a 1-register 
machine computing the partial function / : INg — > INg which yields 0 for even 
numbers and remains undefined for odd natural numbers: 
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Example 1. Consider the 1-register machine M = 0, 2) , where 

i?={0:(5(l),l,2),l:(,5(l),0,l),2:end}. 

More intuitively, we can write down M in a program-like style agreeing in 
the fact that 0 is the initial label: 

0: (5(1), 1,2) 

1 : (5(1), 0,1) 

2 : end 

This simple program works as follows: If at label 0 the contents of register 1 
is zero, we jump to the end label 2 and M stops; otherwise we subtract one and 
continue at label 1, where again we try to subtract one; if this is not possible, 
we enter an infinite loop in 1; otherwise we subtract one and return to label 0. It 
now is obvious to see that in fact M computes the partial function / : INq — >■ INq 
which computes 0 for an even number and remains undefined for an odd natural 
number m. 



The main result established in is that the actions of a deterministic Turing 
machine can be simulated by a 2-register machine: 

Let T be an alphabet of (terminal) symbols with card (T) = z — 1 and 
T = {ai|l<j<z— 1}; then every smbol in T can be interpreted as the 
digit i at base z; hence, every word in T* can be encoded as a non-negative 
integer using the function : T* ^ INq defined by gz (A) = 0, gz (ui) = i for 
1 < i < z — 1, and gz (wa) = gz (w) * z + gz (a) for a € T and w € T*. 

Proposition 1. (see m Let L Q T* be a formal language accepted by the 
Turing machine in such a way that, for every w G T* , halts if and only 
if w G L. Then we can construct a 2-register machine Ml such that, for every 
w G T* , if Ml starts with 2®* in its first register, the program terminates if 
and only if Mt halts on w (i.e., if and only if w G L). Moreover, without loss 
of generality, we may assume that if Ml halts on input then the contents 

of both registers is zero. 

3 The Main Result: Simulating Register Machines 
by Graph-Controlled Grammars 

Using the result from 0 that the actions of a (deterministic) Turing machine 
can be simulated by a register machine with only two registers, i.e., based on 
Proposition ^ we are able to establish our main result: 

Theorem 1. For each recursively enumerable language L there exists a graph- 
controlled grammar Gc = ({^, B, C} , T, (R, {i} , {/}) , A ) , with only three non- 
terminal symbols (A,B,C) and with only two of them (B,C) being used in the 
appearance checking mode, which generates L. 
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Proof. We can only give a sketch of the proof, yet we will specify all necessary 
ingredients for the graph-controlled grammar Gq- 

The main steps of a derivation in Gc can be described as follows: 

— Start with the initial rule {i \ A ^ A, a {i) ,%) . In general, we may use rules 
of the form {k ■. A ^ A,a {k) , 0) as connections between different modules 
of the “program” represented by the graph-controlled grammar Gc- 

— We now generate a terminal word w = a\...ak at the beginning of the sen- 

tential form in such a way that, when adding a new symbol at, in parallel its 
encoding is generated according to the formula (wa) = (w) * z + gz (a) 

using productions A — >■ OiA and using the interpretation of the number of 
non-terminal symbols B and G as the contents of the two registers of a 
2-register machine, which finally yields a sentential form 

For example, let T = {a} , hence, z = 2. Then adding one more terminal 
symbol a is accomplished by the following sequence of rules: 

(fci ■. A^ A,{ki + l},%) 

(k\ -p 1 : A — >■ aA, {ki -I- 2} , 0) 

(fci -p 2 : B — A, {fci -|- 3} , {fci -|- 4}) 

{ki + 3-.A^ AGG, {ki -f 2} , 0) 

(fci -|- 4 : C — A, {ki 5} , {/ci -I- 6}) 

(fci -p 5 A — >■ AB, {fci -l- 4} , 0) 

(fci -p 6 A — y AB, {fci -l- 7} , 0) 

(fci -p 7 A ^ A, {k\, ^2} , 0) 

In this special case of z = 2, in the initial rule we simply may take a {i} = 
{fci}, whereas otherwise we have to choose a module for a terminal symbol 
from T in a non-deterministic way. 

— Copy |u|^ to number of symbols A yielding from wAB^’^^^^ : 

{k2- A^ A,{k2 + l},%) 

(^2 P 1 B — >■ A, {^2 -l- 2} , {/c2 T 3}) 

(^2 -p 2 : A — y AA, {/c2 T 1} ) 0) 

(^2 + 3 : A ^ A, {/C3} , 0) 

~ Generate 2®^ from gz (w) in such a way that the sentential form finally is 
wy with \y\g = |?/|^ = 0, |j/|^ = p. For a successful derivation, 

p must equal one (this last non-terminal symbol A will be removed at the 
end of the derivation in Gc); moreover, observe that for p = 0 we cannot 
continue successfully, because at least in the last step fca -I- 8 of this module 
we need one non-terminal symbol A\ 

(fca : A ^ A, {k^ -I- 1} , 0) 

(fcs -p 1 : A ^ AB, {fca -I- 2} , 0) 

{k^ 2 A — y A, -t- 3, /c3 -t- 8} , 0) 

(ks -|- 3 : A — >■ A, {/c3 -|- 4} , 0) 

(^3 + 4 : B —y A, {k^ + 5} , {^3 -I- 6}) 

{k3 + 5:A^ AGG, {^3 + 4} , 0) 

(^3 + 6 : G — >■ A, {fcs -|- 7} , {k^ -P 2}) 

(^3 -p 7 A — y AB, {^3 -l- 6} , 0) 

(^3 -l- 8 : — >■ A, {^4} , 0) 
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— Simulate 2-register machine Ml (the start label for this simulation of Ml in 
Gc is ^ 4 , the stop label of Ml has to be identified with label / — 1 in Gc)' 
k : {S (1) , I, m) simulated by (fc : B — >• A, {^} , {to}) ; 

k : (S' (2) , I, to) simulated by (fc : C — >■ A, (Zj , {to}) ; 
k : (A (1) , I, T) simulated by (fc : A — >• AB, {1} , 0) ; 
k : {A (2) , 1, 1) simulated by (fc : A — >■ AG, {/} , 0) . 

Hence, the number of non-terminal symbols B and C represents the contents 
of register 1 and register 2, respectively, of the 2-register machine Ml- 

— If Ml has reached the final state, we finish with 
(/-l:H^A,{/},0) and (/:H^A,0,0). 

The final sentential form is of the form where p is the number of non- 

terminal symbols A when starting the simulation of the 2-register machine 
Ml- Hence, for p = 1 we simply obtain the terminal word w € L (remember 
that in this case Ml has halted on 2®"=*^“'^). q 



Programmed grammars are just a special variant of graph-controlled gram- 
mars where we may start and end up with any rule. As already mentioned ear- 
lier, the major drawback of programmed grammars in comparison with graph- 
controlled grammars is the absence of the possibility to specify initial rules. 
Hence, for the programmed grammars in the succeeding proof we need one more 
non-terminal symbol just to start the derivation in a correct way: 

Corollary 1. For each recursively enumerable language L there exists a pro- 
grammed grammar with only four non-terminal symbols and with only two of 
them being used in the appearance checking mode which generates L- 

Proof Let Gc = {{A, B, C} , T, (i?, {i} , {/}) , A) be the graph-controlled gram- 
mar constructed in the proof of Theorem ^ with Lab {Gc) being the set of 
labels associated with the rules in R- Now consider the programmed grammar 
Gp = {{A, B ,C , S} ,T, R' , S) with Lab{Gp) = Lab{Gc) U {start} being the 
set of labels associated (in a one-to-one manner) with the rules in R' and start 
being a new label as well as 

R' = RU {{start : S' — >■ A, {i} , 0)} 

This new rule {start : S — >■ A, {i} , 0) now allows us to start correctly every 
derivation in Gp, too. According to the construction given in the proof of Theo- 
rem Q a terminal word w in a derivation in Gc only appears if at the same time 
we also enter the final state, which guarantees that also in a derivation in Gp a 
terminal word only appears if re is in L. 

Hence, we conclude L{Gp) = L {Gc) = L- □ 

It remains as an open question for future investigations whether the intro- 
duction of this special start symbol S could be avoided as claimed by Henning 
Fernau in P), yet without having to use all three of them in the appearance 
checking mode as in the proof of the main theorem in [3| . 
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One of the tricks used in the proof of our main theorem is the final intersec- 
tion with T* . On the other hand, the language generated by a graph-controlled 
grammar could also be defined by only imposing this restriction of distinguishing 
between non-terminal symbols and terminal symbols by just using the fact that 
a terminal label is reached as the only selection criterium. In that case, of course, 
following the proof of Theorem ^ we also have to check that no non-terminal 
symbol A is left before entering the terminal label. For doing this, we simply 
may replace the rule (/ : B — >■ A, 0, 0) by (/ : A — >• A, 0, {/ -I- 1}) , which now also 
uses the non-terminal symbol A in the appearance checking mode, and take f+1 
as the new terminal label instead of /, where (/ -I- 1 : i? — t A, 0, 0). In that way 
we obtain the following result: 

Corollary 2. For each recursively enumerable language L there exists a graph- 
controlled grammar Gc = {{A, B^C} (R, {i} , {f 1}) , A) with only three 
non-terminal symbols, all of them being used in the appearance checking mode, 
which generates L in such a way that the final label f -\- 1 is reached if and only 
if the current sentential form is a terminal word (and therefore in L). 

4 Normal Forms for Matrix Grammars 

In this section we establish the results for matrix grammars. We first recall some 
interesting results from |2j: 

A matrix grammar G = (N, T, (M, F ) , S) is said to be in the binary normal 
form if N = N 1 UN 2 U {S', #}, with these three sets being mutually disjoint, and 
the matrices in M are of one of the following forms: 

1. (S ^ XA), with A e fVi, A G N 2 , 

2. (X Y,A^a), with X,Y gNi, A € N 2 , a G {N 2 UT)*, 

3. {X ^Y, A ^ with X,Y G Ni, Ag N 2 , 

4. (X — >■ A, A — >■ a), with X G N\, A G N 2 , and a G T* . 

Moreover, there is only one matrix of type 1 and F consists exactly of all rules 
A ^ ff appearing in matrices of type 3; # is a trap-symbol; once introduced, 
it is never removed. A matrix of type 4 is used only once, in the last step of a 
derivation. 

According to Lemma 1.3.7 in |2|, for each matrix grammar G there is an 
equivalent matrix grammar G' in the binary normal form. 

For an arbitrary matrix grammar G = {N,T, {M, F) , S), let us denote by 
ac{G) the cardinality of the set (A GiV|A— >-aGF}. From the construction 
in the proof of Lemma 1.3.7 in 0 one can see that if we start from a matrix 
grammar G and we get the grammar G' in the binary normal form, then ac(G') = 
ac(G). 

A matrix grammar G = {N, T, (M, F) , S) is said to be in strong binary 
normal form, UN = Ni U N 2 U (S', #}, with these three sets being mutually 
disjoint, and the matrices in M are of one of the following forms: 
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1. {S XA ) , with X eNi, Ae N 2 , 

2. {X ^Y,A^ a) , with X,Y gNi, A G N 2 , a G {N 2 UT)*, 

3. {X ^Y,A^ #) , with X,Y G Ni, A G N 2 , 

4. {X — >■ A), with X G Ni- 

Moreover, there is only one matrix of type 1 and only one of type 4 and -F 
consists exactly of all rules A ^ ^ appearing in matrices of type 3, where # 
is the trap-symbol (i.e., once introduced, it can never be removed again). The 
matrix of type 4 is used only once, in the last step of a derivation. Finally, as 
the most important feature, we have ac{G) < 2. 

The strong normal form theorem for matrix grammars now again directly 
follows from Theorem D 

Theorem 2. For each recursively enumerable language L there exists a matrix 
grammar in the strong binary normal form that generates L. 

Proof. Let Gc = {{A, B, C} , T, {R, {i} , {/}) , A) be the graph-controlled gram- 
mar constructed in the proof of Theorem □ with Lab{Gc) being the set of 
labels associated with the rules in R. Now consider the matrix grammar Gm = 
{N, T, (M, F) , 5) with N = NiUN 2 U {5, #}, Ni = Lab (Gc) , N 2 = {A, B, C} , 
F = {B— 7>^,C— >■#}, and M consisting of the following matrices: 

- (S' ^ iA ) ; 

— {X ^ Y, D ^ a) , for {X : D ^ a, a (X) , (f (X)) G R and Y G a (X) with 
X,Y gNi,DG N 2 , a G (X 2 UT)*; 

— {X ~^Y,D^ff), for {X : D ^ a, a (X) , tp (X)) G R and Y G ip (X) with 
X,Y GNi,DG N 2 , a G (X 2 U T)*; 

- (/ ^ A) . 

By construction, Gm is in the strong binary normal form, and obviously simu- 
lates the graph-controlled grammar constructed for L in the proof of Theorem □ 
these observations complete the proof. q 

We should like to mention that the final matrix in a matrix grammar in the 
strong binary normal form could also be of the form (X — >■ A, A — >■ a) with 
X G Xi, A G X 2 , and a G T* like for the binary normal form as defined in 
P). This can easily be achieved due to the construction of the graph-controlled 
grammar in the proof of Theorem ^ because in a successful derivation leading 
to a terminal word a non-terminal A is deleted in the last step leading to the 
terminal label. Hence, if /' is the label of this rule (/' : A — >■ A, {/} , 0) , then 
we simply take (/' — A, A — )> A) as the final matrix in this variant of a strong 
binary normal form for matrix grammars. 

Finally, we can restrict the length of the word a in a matrix (X — >■ X, D — ^ a), 
with X, X G Ni, D G N 2 , and a G (X 2 LIT)* in a matrix grammar in the strong 
binary normal form to two as elaborated in for the binary normal form; we 
leave a proof for this variant to the interested reader. 
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If we also want to bound the total number of non-terminal symbols in a 
matrix grammar, we have to pay the price for that by using one more non- 
terminal symbol in the appearance checking mode: We may use a special non- 
terminal symbol D to encode the elements of the label alphabet iVi by suitable 
powers of D, thus taking advantage of the technique already used in to control 
the sequence of derivation steps in the matrix grammar. Moreover, if g — 1 is the 
maximal power for encoding the labels in Ni , then during a successful derivation 
leading to a terminal word, at no time more than g — 1 symbols can be present 
in a sentential form of such a derivation; therefore can be used as a kind of 
“trap symbol” as exhibited in |^. Hence, we immediately obtain the following 
result : 

Theorem 3. For each recursively enumerable language L there exists a matrix 
grammar Gm = B,C, D} ,T, (M, F) , B) with card{F) < 3 that generates 
L. 

Proof. Let Gc = {{A, B, G} , T, (i?, {i} , {/}) , A) be the graph-controlled gram- 
mar constructed in the proof of Theorem Q where without loss of generality we 
may assume Lab {Gq) = {j | 1 < J < — 1} as well as z = 1 and f = g — 1. 

Now denote a sequence of m equal productions p in a matrix by (p) . Then, 

from Gc we construct the matrix grammar Gm = {AI' VJ {D} ,T,{M,F) ,B) 
with N' = {A, B, G} , F = {X — >■ \ X G {B, G, D}} , and M containing the 

following matrices: 

— (£)—;> Z?®, B — DA) is the start matrix; 

- (^{D ^ X)\D ^ D3,A^ AD\X ^ aj for (i : X ^ a, cr (f) , p (f)) G R 
with X G ZV', a G {N' U T)* , and j G a (i ) ; 

- [{D ^ X)\D ^ D\A ^ AD^ ,X ^ for (i : X ^ a, cr (z) , p (z)) G R 
with X G {B, G} , a G (X' U T)* , and j G ip (z) ; 

— — >■ , D — >■ is the final matrix. 

The start matrix has to be applied as the first matrix in a successful deriva- 
tion; a rule D — D® must not be applied in a derivation that should lead to a 
terminal word in L, because in every matrix at most g — 1 symbols D can be 
removed, before the generation of g symbols D would be enforced again. As at 
least one copy of the non-terminal symbol A is present in each sentential form 
of a derivation of a word w G L in the graph-controlled grammar Gc except for 
the first and the last one, it is guaranteed that one of these occurrences of A can 
be used for the productions A — >• AD^ , which allows us to simulate Gc by Gm 
in a correct way. 

Finally, after the application of the final matrix removing the control symbol 
D, the initial matrix cannot be applied again, because after the application of 
the terminal matrix no non-terminal symbol B can be present any more in the 
underlying sentential form. This observation completes the proof. q 
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5 Conclusion 

We have shown that all recursively enumerable languages can be generated by 
matrix grammars, programmed grammars, and graph-controlled grammars, re- 
spectively, with appearance checking with only two non-terminal symbols being 
used in the appearance checking mode. If we also want to bound the total num- 
ber of non-terminal symbols, then we have to pay off by needing one more non- 
terminal symbol to be used in the appearing checking mode in the case of matrix 
grammars. The total number of non-terminal symbols can even be restricted to 
three in graph-controlled grammars, but only to four in the case of programmed 
grammars and matrix grammars; whether this bound can be reduced to three 
again for programmed grammars as shown by Henning Fernau in |3j without 
having to use all three of them in the appearance checking mode, remains for 
future research. Moreover, in a similar way as done by Henning Fernau in |3|, 
all our results can be carried over from graph-controlled, programmed, and ma- 
trix grammars with appearance checking working under free derivations to the 
corresponding types of grammars working under leftmost derivations. 
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Abstract. A direct universal extended H system receives as input the 
coding of an extended H system with double splicing and simulates it. It 
is the first time that a direct construction is described: universal results 
obtained until now were based on the simulation of universal type-0 
grammars or Turing machines. 



1 Introduction 

DNA based computation, generally speaking, considers the transformation of bi- 
ological molecules as computational steps. Recently, many different computabil- 
ity models have been proposed under the inspiration of such biological processes. 
Splicing systems (see m, C2)) are a generative mechanism based on the splicing 
operation as a model of DNA recombination. If these systems have finite sets 
of axioms and rules defining splicing they can generate regular languages (see 
details in dSl). Keeping both sets finite the generative power can only be in- 
creased by introducing control systems. In this way splicing systems can generate 
recursive enumerable languages. 

A computability model C is computationally complete if the devices in C have 
the power of Turing machines (or of any other type of equivalent device), that 
is they generate and/or recognize recursive enumerable languages. 

There are several systems based on splicing that are computationally com- 
pete. For all control systems the demonstration is based on the simulation of 
type-0 grammars or Turing machines (see 0 and P] for permitting and for- 
bidding context; m for target languages, m for programmed and evolving H 
systems, HU for double splicing, |3| and m for multisets). 

Another property that C may have is universality. This is related to the exis- 
tence of a fixed element in C which is able to simulate any other given device in 
C. Such an element is called universal. The first description of a universal Tur- 
ing machine was given by Turing himself in m- Simplified models of universal 
Turing machine can be found in HU and m- 

Any model C that is computationally complete has a universal device. In fact, 
this is the device that is able to simulate the universal Turing machine, which, 
in turn, may simulate any device from C. 

It is clear that, in the above sentence, splicing systems are universal (explicit 
constructions were made in |S|: lU and PI). However this fact is obtained 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 226-|2^^ 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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with an indirect construction: the universal splicing system receives as input a 
coding of a Turing machine (for instance) and if this machine is universal, also 
the splicing system will be universal. In this way, if the input of the simulated 
universal Turing machine is the coding of a splicing system, we will have a system 
based on splicing simulating another one. 

If a splicing system receives as input the coding of another extended H system 
and simulates it we will have a direct construction. 

So the difference between indirect and direct construction relies on the na- 
ture of the input: if it is the coding of a splicing system we will have a direct 
construction, if it is the coding of another kind of computability models we will 
have indirect construction. 

The description of a ‘small’ direct universal extended H system was indicated 
as research topic in US). In this paper we describe a direct construction of a 
universal splicing system and we will see how its size is smaller than the ones of 
the indirect universal extended H system. 

2 An Overview of Splicing 

We give definitions strictly related to our work; more general information may 
be found in US). Consider an alphabet V and two special symbols, # and $ 
not in V. A splicing rule is a string of the form r = ui#U2$U3#U4, where 
ui,U2,U3,U4 G V*] U1U2 and U3U4 are called sites of the splicing rule. For such 
a splicing rule r and strings x, y,z,w € V* we write: 



(a;, y) (z, w) iff x = X1U1U2X2, y = yiU3U4y2, 

Z = XiUiU4y2,W = yiU3U2X2, 

for some xi,X2,yi,y2 G V* , 

indicating that x and y splice according to r giving z and w. 

Based on this operation, the notion of an H scheme is defined as a pair 
cr = (U, R), where V is an alphabet and R C V*4i^V*%V*4i^V* is a set of splicing 
rules. For an H scheme and a language L C V* we define 

a{L) = {z G U* I (x,y) \~r {z,w) or (x,y) \~r {w,z), 

for some x,y G L,r G R,w £ V*}, 

a^{L) = L, 

= a\L) U a{a\L)), i > 0, 
a*{L) = \Ja\L). 

i>0 

If we consider two families of languages FLi and FL2, we define the family 
of splicing languages with FLi axioms and FL2 rules, as: 



H{FLi,FL2) = I L G FLi and cr = {V,R),RG FL2}. 
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We denote by FIN, REG the families of finite and of regular languages 
respectively. We have (see details in M) 

FIN c H{FIN, FIN) c REG. 

An extended H system is a construct 7 = (V, T, A, R), where a = (V, R) is an 
H scheme and T is an alphabets so that T CV {T is called terminal alphabet), 
A is a language over V {A is the set of axioms) . The language generated by 7 is 

l(7) = cr*(A) nr*. 

If both sets A and R are finite, then ^(7) is regular. 

If we want an H system having A and R finite generating more than regular 
languages, then a control system has to be added to the splicing operation. There 
are many classes of controlled systems and most of them characterize the family 
of recursively enumerable languages (see ^ and HSl). 

The type of control we choose in order to implement our construction is 
called double splicing. An extended H system with double splicing (EHSds) is 
defined as extended H system where the two strings obtained from a splicing are 
immediately used in another one. 

Formally, if we consider an extended H system 7 = (V, T, A, R) where x, y, z, 
w € V* and ri,r2 € R we write: 

(x, y) \~n,r2 iff v &V* \ (x, y) hn (u, v) and (u, v) \~r^ {w, z) 

For a language L C V* we define 

aa{L) = {z gV* \ (x,y) \~r^,r2 {z,w) or (x,y) \~r^^r2 {w,z), 

for x,y G L, ri , T2 G R,w G V*}, 

The definition of crj is similar to the one of a* given above. We associate to 
7 the language Ld(j) = cr’^iA) n T*. 

Given two families of languages FLi and FL2 we denote with EHd{FLi,FL2) 
the family of languages generated as above by extended H systems with double 
splicing 7 = (y, T, A, R) so that A G FLi and R G FL2. EHd{[k], FL2) indicates 
the family of languages generated by extended H systems with double splicing 
having A G EIN, card{A) = k and R G EL2 {card{A) is the number of elements 
of A). 

3 Overview of the Construction 

A direct universal extended H system has its own splicing rules and axioms 
and additionally receives as input other axioms defining the system to simulate. 
These input axioms define both the splicing rules and the axioms of the simulated 
system. Applying its own splicing rules to its axioms and to the ones received as 
input the universal system simulates the input system. The language obtained 
by the universal system can be a code of the one of the simulated one. 
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When we began to think to which system to use to implement the idea of 
simulation of splicing we had, we analyzed all kinds of extended H system. Obvi- 
ously all of them share the operation of splicing but some add to this operation 
another one that is the control system itself. For instance in an extended H sys- 
tem with permitted context (see m for details) the presence of the permitted 
string(s) related to each rule has to be checked. 

The advantage of the double splicing is that the control is obtained without 
introducing auxiliary operations - one merely has to keep track of the two steps 
cycle during computation. 

For this simplicity we have chosen double splicing as our control mechanism. 
Moreover the basic operation, rotation of strings, is easy to implement with it. 



First splicing 

1.0 Creation of pairs of axioms 

1.1 Rotation of pairs of axioms 

1.2 Rotation of substrings 

1.3 Join rule-pair of axioms 

(a working string is obtained) 

1.4 Rotation of a working string 

1.5 Matching of the first splicing site 

1.6 Marking of a splicing site 

1.7 Wrong match rule-pair of axioms 

1.8 End of matching. Ordering 

1.9 Matching of the second splicing site 

1.10 Marking of a splicing site 

1.11 Wrong match rule-pair of axioms 

1.12 End of matching. Ordering 

1.13 Splicing 



Second splicing 

2.1 Rotation of pairs of axioms 

2.2 Rotation of substrings 

2.3 Join rule-pair of axioms 

(a working string is obtained) 

2.4 Rotation of a working string 

2.5 Matching of the first splicing site 

2.6 Marking of a splicing site 

2.7 Wrong match rule-pair of axioms 

2.8 End of matching. Ordering 

2.9 Matching of the second splicing site 

2.10 Marking of a splicing site 

2.11 Wrong match rule-pair of axioms 

2.12 End of matching. Ordering 

2.13 Splicing 

2.14 End simulation 



Fig. 1. Structure of the universal system 



Most part of the work of the universal system will be done at the ends of 
strings. These ends have normally the form {head and tail) where (3 defines 
the state of the string. Symbols s (with subscripts) present in strings, are used 
to separate different logical parts of a string. 

Moreover to rotate a string will mean to move symbols between h and t 
(with whatever subscript) from left to right or from right to left. In this context 
the symbols s are important to keep track of the begin and the end of the ro- 
tated substring. For instance the clockwise rotation of two symbols of the string 
hsbViV2V3Set will bring to hv^SeSbV\V2t- Note how, even when rotated, the “sub- 
string” ViV2V^ from Sb to Sg is unchanged. 

Basic operation. Rotation of a string (together with substitution, deletion or 
insertion of one or two rotated symbols) is the basic step in the system we de- 
scribe below. It is performed using a single double splicing. Here we give an 
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example for anti-clockwise rotation with substitution of a symbol. The general- 
ization to the other cases is easy and left to the reader. 

Let us consider the string hiabsicti which we want to rotate anti-clockwise 
by one symbol, changing a into A. In this way we obtain hibsicAti. To do this 
we use the string hiAt\ present as an axiom in the system. 

The operation is performed by one double splicing presented below where 
both splicing sites are indicated by vertical bars (|): 

hia I bsicti hibsic \ ti hibsicAti 
hi I At\ b I h\(x I Ati b 2 h\(xti 

Firstly the rule 1 : is used and h\a is replaced with h\. 

Now the second splicing operation has to be performed between the two strings 
obtained by the previous splicing. The rule 2 = changes t\ into 

All obtaining the string we wanted. 

The universal system simulates a double splicing of the system given as input 
by repeating several similar operations, see Figure 1. Strings will have different 
states according to the basic steps they are involved in. Now we to give an 
overview of the simulation algorithm indicating the different states of strings 
during all the process. 



Single axioms. Axioms of the simulated system will be coded in the universal 
system by strings of the type: 



hi b siti 



Dots present in the box indicate the code of the axiom of the simulated sys- 
tems. The other symbols are used by the universal system for internal purposes. 
The symbol b indicates the begin of the code of an axiom (its use will be clear 
later when this kind of strings are rotated); s’s (with any subscript) are used as 
separators of logical parts of a string when they are joined. 



Rules. Also splicing rules of the simulated system are translated into strings in 
the universal one: 



sitei site 2 



/ig 




# 




$ 




# 





sitei and site 2 indicate the two splicing sites in a rule. Each symbol a is repre- 
sented in the rule by its barred copy a to distinguish it from the symbols in the 
axioms. 



1.0 Creation of pairs of axioms. The simulation of a double splicing begins 
by joining in a non deterministic way two strings representing axioms of the 
simulated system. The first splicing is performed on this pair of strings and the 
second on the result of the first splicing. 
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The indications axiomi and axiom 2 will be used from now on to keep track of 
the two axioms present in a pair. 

1.1 Rotation of pairs of axioms. As said before, rotation is the basic oper- 
ation in the universal system. The next picture illustrates how the rightmost si 
is moved to the left and then, with the iteration of the process, how the order 
of the two axioms can be interchanged. In this way such strings can have any of 
their circular permutation and this is important when a rule is joined to them 
(paragraph 1.3). 

axiomi axiom 2 
hs b Si b Sits 

axiomi axiom 2 




1.2 Rotation of substrings. A substring can be rotated with respect to the 
logical part of the string it occupies. In the case of a pair of axioms, each of the 
axioms is such a logical part and forms a substring delimited by Si’s. 

The next picture illustrates the clockwise rotation of a symbol f3 in axiom 2 - 
Now it is possible to understand the presence of b and si in a pair of axioms. If 
both symbols were not present it would be impossible to read the two axioms in 
the proper way after rotation. 

axiomi axiom 2 axiomi axiom 2 

hs b Si b aj3 sits ^ hs b si Pb a sits 

The rotation of a substring is made in a non deterministic way. Thanks to 
this process the representation of a pair of axioms can have its two axioms in any 
possible circular permutation, this is important when a rule is joined to them 
(next paragraph). 

1.3 Join a rule to a pair of axioms (a working string is obtained). In 

a non deterministic way a string representing a rule can be joined with a pair 
of axioms as to form a working string. It is on this kind of strings that the 
simulation of a double splicing is performed. 
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axiom2 axiomi 



h. ...b... 



Si 



...b... 



Sits 



sitei site2 



axiom2 axiomi sitei 



sitc2 



'■10 



...b... 


Si 


...b... 




# 




$ 




# 





Note that only one Si is present in the working string, the other one has been 
removed during the join operation. Axioms and splicing sites are adjacent: no 
separator is present between them. 



1.4 Rotation of a working string. As for a pair of axioms, a working string 
can be rotated. This is important to match splicing sites with axioms: particular 
groups of symbols present at the two ends of a working string are used in order 
to simulate the splicing. 



no 



axiom2 

1 . ..&... 



Si 



axiomi sitei site2 

#Q$Q#Qtio 



...b... 






axiom2 axiomi sitei site2 



1.5 Matching of the first splicing site. The symbols of the rule are matched, 
one by one, with the symbols in the axiom at the splicing site. Rotating a working 
string it is possible to move aa close to the head of the string (we describe only 
this case, it also possible to have aa close to the tail). In this case (showed 
in the next picture) the matching axiom is rotated by one symbol (a) and the 
barred symbol in the splicing site is removed. When this operation is completed 
the working string is ready to match the next symbols (in the example [3 and (3 
respectively). 

sitei site2 axiom2 axiomi 



hioa ap... # 



# 



...b... si---b...Ptio 



sitei site2 axiom2 axiomi 



p... # 




Si ...b...P tai 


axiomi 


ii 

sitei site2 


axiom2 


...b...p 


■ ~p... # ... $ ... 


• ...b... Sitai 


axiomi 


sitei sitc2 


axiom2 


...b...p 


-p... 


...b... si^tio 



This operation is performed through different steps. When a couple aa is 
present both symbols are removed and the state of the working string in changed 
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into ai,i = 1,2, 3, 4 (the subscript of a is used to refer to the matching that is 
going on: 1 or 2 for the the first or second match of the first splicing, 3 or 4 for 
the first and second match of the second splicing). 

Then the working string is rotated clockwise until si is present at one end. 
At this point a is added on the right of si and the state of the string in changed 
back into 10. The process of matching continues rotating the working string. 



1.6 Marking a splicing site. When the first part of the rule is matched, a# 
is met on the left side of the working string (we can also have on the right 
side and the process is similar to the one that we describe). At this point # 
remains in the checked axiom to keep track of the position where the splicing 
has to occur, the operation of matching goes on with the rest of the splicing site 
(if present). Anyhow the # has to be moved in respect to the checked axiom so 
to permit the continuation of the matching. This is done removing ^ from the 
splicing site {sitei in our example) and changing the state into = 1,2, 3, 4 
(1 or 2 for the first or second match of the first splicing, 3 or 4 for the first or 
second match of the second splicing). Then the string is rotated clockwise until 
Si is close to the tail. At this point # is added on the right of Si and the state 
of the string is changed back into 10. The process of matching can go on. 



sitei sitc2 axiom2 axionii 



^10 01 ^ ■■■ 





$ 




# 






Si 



''10 



11 

sitei sitc2 axiom2 axiorrii 








$ 




# 




...b... 


Si 



1 



axiomi sitei site2 axiom2 



^#1 



...b... 




$ 




# 




...b... 



Sl^#i 






axiomi sitei site2 



axiom2 



...b... 



SiffFt 



10 



1.7 Wrong match rule-pair of axioms. Of course it is also possible to have 
a wrong match between a rule and a pair of axioms. This is detected by the 
presence of af 3 where a yf /3. In this case the working string changes state into 
11 and it cannot take part in any further splicing. 



sitei site2 axiom2 axiomi 
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1.8 End of matching. Ordering. When the match between axiomi and sitei 
is completed the working string will have $ on its extreme left (as all symbols 
present in the splicing site have been consumed). The operation to perform now 
is ordering, it means to rotate the just matched axiom so to have the b on its 
extreme right (extreme left for the second match). This is necessary for the 
implementation of the splicing operation. The ordering is made in a way similar 
to the rotation of substrings (paragraph 1.2). 



site 2 axiom 2 axiomi 




axiom2 



axiomi 




The matching of the second splicing site for the first splicing is symmetric 
to the first matching just described. Encountered situations are similar but mir- 
rored: what present on the left for the first match is present on the right for the 
second and vice versa, if the rotation is clockwise for the first match it will be 
anti-clockwise for the second and vice versa. 



1.13 Splicing. If the second splicing site matches the other axiom, after order- 
ing, the working string is ready to simulate the first splicing. The situation will 
be similar to the one indicated in the next picture. The splicing is simulated in a 
really simple way: substituting and removing symbols, b's are removed, si’s are 
changed into 6’s and #’s into si’s. The obtained string indicates the result of 
the simulation of the first splicing written with the syntax used until now: si’s 
separate the two substrings and b's indicate their begin. 

This obtained string is similar to the pair of axioms created in the paragraph 
1.0, so the operations performed for the second splicing are similar to the ones 
for the first one. Only the states of the working string will be different. 



axiom 2 axiomi 
Si 

■ ■■b... Si [--b tig 





2.14 End simulation. After the second splicing the two substrings are sepa- 
rated in two independent strings in state 1. A double splicing has been performed 
and the two resulting strings can be involved in a new one. 
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4 Inside the Algorithm 

In order to define a universal EHSds we need to consider the coding of an ar- 
bitrary EHSds as input for the universal system. As the universal EHSds has a 
fixed, finite, alphabet, our coding scheme must allow the encoding of arbitrary 
alphabets. Additionally, the coding transforms both axioms and rules of a sys- 
tem into additional axioms for the universal system. The coding must be “fair” 
i.e., it should be simple and not solve computational problems beforehand. 

Let 7„ = (U„, T„, A„, i?„) and 7 = (V,T,A,R) be EHSds’s. Moreover con- 
sider the functions: 

ca-.v*^ v:, 

Cr : V*#V*%V*#V* V* 

]~) . V T”'* 

^ ^ U ' ^ 

where Ca transforms axiom in A into axioms in Cr transforms rules of R 
into axioms in A„ and D transforms final strings of the universal system into 
final strings of the simulated one. 

The system 7„ is called universal if it generates the language of 7, when we 
add the axioms and rules of 7 to 7„ (under a suitable coding). 

Formally, 7„ is universal iff, for each EHSds 7, D(L('ju('y))) = L(j) where 
7«(7) = (K,T„,A„UCa(A) 

Functions Ca,Cr and D. We say that a string g € UJ' is in state li A q = 
hi-^^...hi^etij...ti-^ while no hi and/or ti is present in e. Moreover e will be called 
the information of the string q. 

With input-axioms we refer to the elemets of Ca(A) and to their information; 
with input-rules we refer to the elements of Cr{R) and to their information. 
The contexts will help the reader to understand in which meaning we use input- 
axioms and input-rules. 

We will encode any arbitrary alphabet into the alphabet W = {0, 1, 1}, as fol- 
lows. Assume that the input system has V = {v\,V2, ■■■, v„}, T = {vi,V2, ■■■, Vm} 
with m < n, then the input coding / : U* — >■ W* is the morphism defined as 
rf \ ( iO^ Vu = Vj €V,j<rn 

^ [ 10^ Vv = VjeV,m-hl<j<n 

Now axioms are Ca(x) = hib f{x) siti,Vx € A. 

An input-axiom is always in state 1 and the information it contains has the 
form: b{code), where {code) represents one axiom of the system to simulate. The 
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b indicates where the information of the axiom begins. This is important as 
input- axioms are rotated. 

Let us define the set W = {a \ a G W} and the homomorphism bar : 
(VF U {#, $})* ^ (fT U {#, $})* by bar (a) = a. 

Moreover let the function mirror image be mi : (W U {#})* — >■ {W U {#})* 
so that if a = a\...ap is in {W U {#})*, then mi{a) = ap...a\. 

If a; = is a rule present in the input system, then it will be 

added as axiom to the universal system, coded as follows: 

Cr{x) = h3h2Sibar{mi{f{ui)#f{u2))%mi{f{u3)#f{u4:)))t2t3 
Observe that input-axioms and input-rules are written in opposite way. For exam- 
ple, if we consider the string uiUiU 2 U 3 U 4 and the splicing rule vi^V 2 V 3 $V 2 #V 3 Vi 
their codings, according to the above, is: 

/ii61010100100010000ti and /i3/i2Si000l00l#0l$0l000l#00lt2t3 

Let us consider the splicing site 000100101 present on the left of $ in the 
input-rule. We may see that it is the mirror image with barred characters of 
the substring 101001000 present in the input-axiom. This is important for the 
operation of matching between a input-rule and a input-axiom performed by the 
system. 

The decoding function D is defined as: 

D(hivbwsiti) = f~^{wv), 

where / is the homomorphism for the encoding of the alphabets used above. 

As the language generated by the universal system consists of strings in state 
1, with a marker b indicating the start of the rotated string, the decoding is easily 
understood. 



5 The Direct Universal System and Its Complexity 

Theorem 1 The system = (U, i?„) of type {FIN, FIN) described 

in the previous section is universal for the class of EHSds: if the set of axioms 
Ca(A) U Cr{R) related to a EHSds 7 = {V,T, A, R) is included in 7 „ it will 
produce L( 7 „( 7 )) so that D{L{'ju{l))) = ^( 7 ), where D is the decoding function 
defined in the previous section. 

Proof: The different paragraphs present in Section 0 indicate how 7 „ can 
simulate a double splicing of the simulated system starting with the code of 
two axioms of that system. More detailed information about this process can be 
found in 0. The two strings created at the end of the simulation of a double 
splicing have the same syntax used to code axioms of the simulated system. So 
they can be used to simulate subsequent splicings. 

Now we are going to demonstrate that the only terminal strings created by 
7 „ are the ones coding strings generated by the simulated system. 

Let us begin to say that none of the strings in A„ is terminal, so terminal 
strings can be present in the input-axioms and the input-rules or can be generated 
by splicing. 
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Our implementation of the algorithm with double splicing is made such that 
all rules used are in couples: the first one recognizes the pair (head, symbol) or 
(symbol, tail) in a string s and performs the first splicing with a short axiom 
belonging to Au fully indicated in the splicing site of the used rule. In this way 
one end of s is changed and a new short string is created. This short string is 
particular as created by a specific pair (head, symbol) or (symbol, tail) of s and 
the axiom used in the first splicing. The second splicing rule of a couple has in 
one of its splicing site an essential part of the new created short string. 

In this way it is possible to divide the set of rules of the universal system 
in two parts: the first composed by rules used for the first splicings, recognizing 
a pair (head, symbol) or (symbol, tail) present in a generic string s and using 
an axiom of 7 „, and the second composed by rules using the new short string 
generated in the previous step and the string s partially modified. 

So no rule of the second set may be used as first because the short string 
they recognize cannot be present. 

In this way we demonstrated that the evolution of a string is deterministic 
and it is driven by the pair (head, symbol) or (symbol, tail) present in a generic 
string s. If now we consider that only strings present in the input-axioms can 
contain the pairs (head, symbol) or (symbol, tail) priming the deterministic 
process, we end up to the conclusion that final strings can be obtained only 
starting from strings present in the input-axioms. 

Short strings generated by a double splicing can be involved in another one 
as they can contain a pair (head, symbol) or (symbol, tail). But the operation 
will generate other non terminal short strings never involved in the deterministic 
process just described. 

It is possible to reduce to one the number of axioms present in the universal 
system. 

Theorem 2 There exists a universal system for EHSds, which is of type 
i[l],FIN). 

Proof: Consider = (I 4 , T„, A„, i?„) the system defined in the previous 
section, and 7 = (U, T, A, R) element of the class of EHSds. We write = 
{Vf,Tu,A'^,R'J of type ([l],F/fV), so that = Hi) V 7 in EHSds. 

If we consider Vf = U {z}, where z ^ 14, it is possible to write the single 
element of A'^ as za\za 2 Z ■ ■ ■ apZ where AuACa{A)ACr{R) = {<^ 1 , 0 : 2 , ■ ■ ■ , ctp}- 
Moreover = i?„ U {#z$z#}. 

Applying twice the rule ffz%zff to two occurrences of the axiom it is possible 
to obtain one element of A„. Repeating this process all elements of A„ can be 
obtained so that can elaborate as 7 „. 

The rule #z$z# cannot be used on elements of A„ as z ^ 14. On the other 
hand if splicing rules in R^ are applied to strings containing z they will not 
generate terminal strings as z ^ and this proves the theorem. 

The number of splicing rules present in 7 ^ is 806, as listed in the Appendix of 
( 3 , and considering the repetition of rules to perform the second match and the 
second splicing. Even if still big, the number of rules is smaller than the number 
for a indirect universal system. If such a system is created using a universal 
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Turing machine, the minimal number of rules would be 222.220 (as indicated in 
Chapter 8 of lEl). 

The complexity of the simulation of a simulated double splicing step can be 
estimated as follows. Let us consider that X\,X 2 S (VFU{&})* are the information 
of the two strings involved in the simulation of the first splicing, and that | x\ |= 
p, I X 2 1= q- Moreover let yi,y 2 G W* be the code of the two splicing sites of 
the rule simulated on X\ and X 2 , \ yi \= x and | y 2 |= s. The symbol t and 
u are related to the number of symbols rotated during the ordering operation 
(paragraph 1.8) for the first splicing. The symbols p', g', r', s', t' and u' have the 
same meaning of p, g, r, s, t and u respectively for the second splicing. 

The universal system will need 

10(p+g+p'+g')+4(rp+sg+r'p' + s'g')+t(4+4p)+M(4+4g)+t'(4+4p')+u'(4+4g')+48 
splicing operations to simulate a double splicing. The detailed explanation of the 
previous expression is present in [Z]- 

6 Final Remarks 

As expected the number of rules present in a direct universal H system is smaller 
than the one needed by an indirect one. The result described here is still far from 
a possible implementation in a biological laboratory, but we hope that this paper 
stimulates possible improvements of the algorithm. For instance, it is possible 
to consider to implement direct universal H systems using a control systems 
different from double splicing, or, for instance, it could be interesting to use 
circular strings. This last consideration comes from the fact that rotation, basic 
operation in our algorithm, is automatic in circular strings. 

Moreover our algorithm is strongly linked to the idea of a sequential state 
machine: the pairs (head, symbol) or (symbol, tail) are really close to the way 
of working of Turing machines. Maybe it is possible to base a universal system 
on a totally different idea. 

Considering that all extended H system with a control system generate the 
class or RE languages it is easy to imagine how to translate one system to another 
one through a Turing machine. The idea to simulate splicing using splicing that 
we described can be use to create direct translation between extended H systems 
with different control systems. 
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Abstract. There are two simple models of cellular automata: a semi- 
inhnite array (with left boundary) of cells with sequential input mode, 
called an iterative array (lA), and a finite array (delimited at both 
ends) of n cells with parallel input mode, called a bounded cellular ar- 
ray (BCA). This paper presents a quadratic speedup theorem for lAs and 
an exponential speedup theorem for BCAs by using alternations. It is 
shown that for any computable functions s{n),t{n) > n, every s{n)t{n)- 
time deterministic lA can be simulated by an 0(s(n))-space 0{t{n))- 
time alternating lA. Since any t(n)-time lA is t(n)-space bounded, ev- 
ery (t(n))^-time deterministic lA can be simulated by an 0(t(n))-time 
alternating lA. This leads to a separation result: There is a language 
which can be accepted by an alternating lA in 0{t{n)) time but not by 
any deterministic lA in 0{t(n)) time. It is also shown that every t(n)~ 
time nondeterministic BCA can be simulated by a linear-time alternating 
BCA. 

1 Introduction 

One of the simplest models for parallel recognition of languages is the cellu- 
lar automaton (CA). A CA is a one-dimensional array of identical finite-state 
automata, called cells, which are uniformly interconnected. Every cell operates 
synchronously at discrete time steps and changes its state depending on the 
previous states of itself and its neighbors. There are two simple models of CA: 
a semi-infinite array (with left boundary) of cells with sequential input mode, 
called an iterative array (lA), and a finite array (delimited at both ends) of 
n cells with parallel input mode, called a bounded cellular array (BCA). 

There is a huge amount of literature on simulation and separation results 
between various types of CA. For example, Umeo et al. uni showed that every 
real-time two-way BCA can be simulated by a linear-time one-way BCA (in 
which information is allowed to move one direction). Dyer jSj proved that every 
nondeterministic two-way BCA can be simulated by a nondeterministic one-way 
BCA. Ibarra et al. showed in jS| that linear-time one-way BCAs are equivalent 
to 2n-time one-way lAs. Iwamoto et al. presented a time hierarchy theorem 
for lAs m, but it is open whether a similar time hierarchy theorem holds for 
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BCAs. (More information on relationships among various models may be found 
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In this paper, we investigate the relationship between alternating and 
(non)deterministic CA. We first present a quadratic speedup theorem for lAs. It 
is shown that for any computable functions s(n),t{n) > n, every s(n)t(n)-time 
deterministic lA can be simulated by an 0(s(n))-space 0(t(n))-time alternating 
lA. Since any t(n)-time lA is t(n)-space bounded, every (<(n))^-time determin- 
istic lA can be simulated by an 0(t(n) )-time alternating lA. It is known that 
if ti{n) and t 2 {n) are computable functions such that lim„_,.oo = 0> then 
there is a language which can be accepted by a deterministic lA in t 2 {n) time 
but not by any deterministic lA in ti(n) time El. Thus, our speedup theorem 
leads to a separation result: There is a language which can be accepted by an 
alternating lA in 0{t(n)) time but not by any deterministic lA in 0(t(n)) time. 

We also present an exponential speedup theorem for the fixed-space model. 
It is shown that for any computable function t(n) > n, every t(n)-time non- 
deterministic BCA can be simulated by a linear-time alternating BCA. (Note 
that tin) = because the number of configurations of any BCA is bounded 
by 

Computations on BCAs can be sped-up exponentially from time to 

0(n) time, while lAs can be sped-up quadratically. There is an essential differ- 
ence between BCAs and lAs. Since an lA consists of an infinite array of cells, a 
t(n)-time lA can use t{n) cellular space. Therefore, in order to speedup a (t(n))^- 
time computation on an lA to t{n) time, we must also compress cellular space 
from {t{n)Y to t{n). Our theorem for lAs is also a space-compression result. 

The first speedup result for CA was shown by Smith he showed that 
t(n)-time BCAs can be sped-up to n -|- t{n)/k time, where fc is a constant. 
A linear speedup theorem for a synchronization problem was presented in 0. 
Buchholz PP considered arrays with restricted nondeterminism; it was shown that 
every linear-time computation on one-way one-guess BCAs can be sped-up to 
real-time. Ozhigov El investigated the trade-offs between the time and space 
complexities of computations on nondeterministic CA of dimension r. It was 
shown that a computation with time complexity t{n) and space complexity s(n) 
can be simulated with time and space complexity 0((t(n))^/*^’'+^^(s(n))’'/(’'“''^^). 

Simulation results between alternating CA and alternating Turing ma- 
chines (TMs) can be found in m and m- Krithivasan et al. El proved that 
alternating BCAs have the same power as linear-space alternating TMs. Mata- 
mala m proved that polynomial-time alternating CA have the same power as 
polynomial-time alternating TMs, and real-time alternating CA are strictly more 
powerful than real-time alternating TMs. 

For TMs, several speedup results have been known. Wiedermann m showed 
that NTIMEi(t(n)) C A 2 -TIMEi(t(n)/logt(n)), where A 2 -TIMEi(t(n)) is the 
set of languages accepted by t(n)-time 1-tape TMs with one alternation from ex- 
istential to universal. For multitape TMs, Gupta jOl proved that DTIME(t(n)) C 
A 2 -TIME(t(n)/log*<(n)). 
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Fig. 2. Bounded cellular array 



In the following section, we give the definition of CA. Main theorems are 
summarized in Section 0 The proofs are given in Sections 0 and 0 



2 Cellular Automata 

A cellular automaton (CA) is a synchronous highly parallel string acceptor, 
consisting of a one-dimensional array of identical finite-state automata, called 
cells, which are uniformly interconnected. Every cell operates synchronously at 
discrete time steps and changes its state depending on the previous states of itself 
and its neighbors. There are two simple models of CA: a semi-infinite array (with 
left boundary) of cells with sequential input mode, called an iterative array (lA) 
(see Fig.0), and a finite array (delimited at both ends) of n cells with parallel 
input mode, called a bounded cellular array (BCA) (see Fig. EJ. 

A nondeterministic lA is a 6-tuple I = {Q,Qa, S,^,6,q), where 

(1) Q is the finite nonempty set of cell states, 

(2) Qa is the accepting subset of Q, 

(3) S is the finite input alphabet not in Q, 

(4) # is the special boundary symbol not in A U Q, 

(5) S : QLI S LI {#} x Q x Q — >■ 2*^ is the local transition function, 

(6) q G Q is the quiescent state such that 6{q, q, q) = {g} and if 5{a, b,c) B q 
then b = q. 

The cell assigned to the integer i > 1 is denoted by c^. At step t = 0, the state of 
each cell is the quiescent state q. The input string 0102 • • • Un, where ai G A, is 
fed serially to the leftmost cell Ci; the symbol a^, 1 < z < n, is the left neighbor 
of the cell c\ at step i — 1. After step n — 1, the left neighbor of c\ is the boundary 
symbol #. A configuration of an lA is represented by a string in S*{Q — {g})*. 

A nondeterministic BCA is a 5-tuple B = {Q, Qa, where 

(1) Q is the finite nonempty set of cell states, 

(2) Qa is the accepting subset of Q, 

(3') A is the finite input alphabet in Q, 

(4') # is the special boundary state not in Q, 

(5') (5 : Q U {#} X Q X Q U {#} — >■ 2^ is the local transition function. 
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The input string 0102 • • • a„ is applied to the array in parallel at step 0 by setting 
the states of cells Ci, C2, . . . , c„ to Oi, 02, . • . , a„, respectively. A configuration of 
a BCA is represented by a string in Q”. 

The definition of a computation tree is mostly from m- Computations of a 
nondeterministic CA given some input are described as a tree T : All nodes are 
configurations, the root is the initial configuration of the CA for the given input, 
and the children of a configuration C are exactly those configurations which can 
be reached from C in one step allowed by the transition function. Leaves of the 
tree T are final configurations, they may be accepting or rejecting. Certain paths 
in T may be infinite. 

An interior node of the tree T is defined to be accepting if at least one of 
its children is accepting; the CA accepts the input iff the root is accepting. 
A nondeterministic CA is said to be t(ri)-time hounded if, for every accepted 
input of length n, there is an accepting node whose distance from the root is at 
most t{n). 

The states of an alternating CA are partitioned into four classes: accepting 
states, rejecting states, universal states, and existential states. Whether a par- 
ticular configuration is a universal, an existential, an accepting, or a rejecting 
configuration is determined by the state of the leftmost cell ci. (This model is 
called a week alternating CA in [TT?].l 

An existential interior node is defined to be accepting if at least one of its 
children is accepting. A universal interior node is defined to be accepting if all its 
children are accepting. An alternating CA M is defined to be t{n)-time bounded 
if, for every accepted input w of length n, the computation tree T oi M started 
with w stays accepting if it is pruned at depth t{n). A subtree of T is called a 
rejecting subtree if its root is not accepting. An alternating CA M is defined to 
be s{n)-space hounded if, for every accepted input w of length n, every node of 
the subgraph obtained from T by removing all rejecting subtrees is represented 
by a string of length at most s(n). 

Clearly, alternating CA without universal states are just nondeterministic 
CA. Nondeterministic CA of which each node of the computation tree has at 
most one child are deterministic CA. 

3 Speedup Theorems 

It is well known that linear-time speedup and linear-space compression theorems 
hold for CA this paper, we present a quadratic speedup theorem for lAs 

and an exponential speedup theorem for BCAs by using alternations. Let hin{n) 
denote the binary string (of length [log n] ) which represents the value n. 

Theorem 1. Suppose that s{n) > n and t(n) > n are eomputahle functions such 
that there is an 0{s{n))-space 0{t{n))-time TM which, given bin{n), generates 
both bin{s{n)) and bin{t{n)). For any s{n)t{n)-time deterministic lA M, there 
exists an 0{s{n)) -space 0{t{n))-time alternating lA which can simulate M. 

The proof of this theorem is given in Section 0 (The definition of com- 
putable functions are from HH. Note that the set of computable functions t(n) 
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and s(n) includes all functions computable by polynomial-time TMs, since the 
input length of TMs is [logn].) Since s(n)t(n)-time lAs are obviously s{n)t{n)- 
space bounded, “s(n)t(n)-space” is omitted in the statement of Theorem ^ If 
we omit “0(s(n))-space” from Theorem 1, we obtain the following corollary. 

Corollary 1. Suppose that t{n) > n is a computable function such that there is 
an 0{t{n))-time TM which, given bin{n), generates bin(t{n)). For any (t(n))^- 
time deterministic lA M , there exists an 0(t{n))-time alternating lA which can 
simulate M . 

It is known that if ti(n) and t 2 {n) are computable functions such that 
lim„_>oo = 0, then there is a language which can be accepted by a de- 
terministic lA in t 2 {n) time but not by any deterministic lA in ti(n) time jl Ij . 
Therefore: 

Corollary 2. Suppose that t(n) > n is a computable function such that there is 
an 0{t(n))-time TM which, given hin(n), generates bin(t(n)). Then, there exists 
a language which can be accepted by an alternating lA in 0(t(n)) time but not 
by any deterministic lA in 0{t(n)) time. 

We turn our attention to time complexity of the fixed-space model. 

Theorem 2. Suppose that t{n) > n is a computable function such that there is 
an 0{n)-time TM which, given bin{n), generates &m(|"logt(n)]). For any t{n)~ 
time nondeterministic BCA M , there exists a linear-time alternating BCA which 
can simulate M . 

The proof is given in SectionEl Note that the number of configurations of any 
BCA is bounded by Theorem |3^ii) implies that nondeterministic BCAs 

can be sped-up exponentially. 

4 Quadratic Speedup Algorithm for lAs 

In this section, we prove Theorem ^ Let Mjy be an arbitrary s(n)t(n)-time 
deterministic lA. We construct an 0(s(n))-space 0(f(n))-time alternating lA, 
say, Ma, which can simulate Mjy. A function f{n) is said to be constructible if, 
for each n, there is an lA whose accepting cell ci enters an accepting state at 
step f{n) on all inputs of length n. It is known that all functions computable by 
TMs are constructible by lAs m- Since s{n) and t(n) are computable functions, 
we can assume they are also constructible. 

Before describing details, we observe some basic ideas of the proof using a 
simple example. Consider the ith cell of Mjj (see Fig. E)- The state of the zth cell 
at step r is determined by the states of 2s(n) -|- 1 cells at step r — s(n). In order 
to describe such a situation, the array of Ma’s cells is divided into tracks (see 
Fig.^. The second track contains the state of the Ah cell of Mjj at step r. In 
order to simulate an s(n)-step computation ol Md, Ma existentially generates 
(guesses) 2s(n) -I- 1 states in the first track. Then, using a universal branch (see 
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Fig. 4. Tracks of alternating lA Ma 



Fig. El;b)), Ma verifies whether the generated 2s(n) -|- 1 states are consistent 
with the state of the ith cell oi Mo at step r according to the transition func- 
tion of Mo- Also, Ma goes up to the next level (see FigEfa)); Ma universally 
chooses one cell in order to start the simulation of the next level (see the cell 
having state 7 in Fig. 0 . Then, Ma stores the state 7 of the chosen cell into the 
second track in 0(1) steps. Furthermore, Ma computes the values t — s{n)^i + j 
and stores them into the third track in 0(1) steps. (These 0(l)-step procedures 
require some techniques, which will be described later.) Ma repeats this algo- 
rithm t{n) times. Finally, Ma verifies whether the first track contains a part 
of the initial configuration of Mo (see track G in Fig. 0 . In the following, we 
describe the algorithm in detail. 

(1) At step 1, Ma performs three procedures (1.1), (1.2) and (1.3) in parallel 

using three tracks. (1.1) Ma stores the input string 0102 into n cells. 

Cl, C 2 , • • • , c„, where a„# is stored into c„ as a single symbol a*. This procedure 
needs only 2n steps. (1.2) Ma simulates an lA, say, M 4 S+ 2 , which makes exactly 
4s(n) -I- 2 moves. (1.3) Ma sends a unit-speed pulse from ci. On the pulse, Ma 
existentially generates (guesses) a string of the form 00 • • • 01 of length 2s{n) -1-1. 

(2) Ma universally performs procedures (2.1) and (2.2) simultaneously. 

(2.1) Ma verifies whether the string 00 • • • 01 has length 2s(n) -I- 1 as fol- 
lows. The cell containing “right-boundary” symbol 1 sends a unit-speed pulse 
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Fig. 5. s(n)-step computation of Md and simulation of Md by Ma 



toward c\. If the pulse reaches C\ and M 4 S +2 enters an accepting state at the 
same time, then Ma knows that the length of 00 • • • 01 is exactly 2s(n) + 1, and 
Ma halts with an accepting state; otherwise, Ma halts with a rejecting state. 

(2.2) Ma assumes that the region filled by the string 00 • • • 01 has 
length 2s{n) + 1. In the following, Ma does not violate the space limita- 
tion 2s(n) -I- 1. The array of Ma’s cells is divided into tracks. In a track, say. A, 
Ma copies the input string 0102 ••• a* as many as possible in order that any 
2n — 1 contiguous cells contain the input string (see Fig. E). This can be done 
in 0(s(n)) steps. 

Ma uses tracks G, S, T, /, and C. (G, S, T, I, and G stand for Guessed states, 
current State, step t, cell i, and Counter. Three tracks in Fig. 0| correspond to 
track G, track S, and tracks T,I.) Each of tracks T and I is further divided 
into subtracks. Let ai,gi,Si be subcells of cell Ci in tracks A,G,S, respectively. 
We denote the state of the ith cell of Mo at step r by state(T,i). Note that 
state(T, 0) = Or for 0 < r < n— 1, and state(r, 0) = ^ for t > n. For convenience, 
we assume state(r,i) = ^ for j < 0. In order that all cells of Ma start the 
following procedure (3) simultaneously, Ma uses the firing squad synchronization 
algorithm m- 

(3) Without loss of generality, we can assume that Mo has the unique ac- 
cepting state qa- Initially, track S contains a string QaQa • • ■ 9a of length 2s(n)-|-l. 
Note that all subcells (including the center subcell) of track S contain the same 
state Qa- Tracks T and I contain values r = s(n)t(n) and i = 1, respectively. It 
should be noted that t{n) > s{n), and the values of computable functions s(n) 
and t(n) are represented by binary strings of length [log t(n)~\ . Thus, the value 
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of s{n)t{n) can be computed in time polynomial in logt(n). Tracks T,I,S rep- 
resent that the leftmost cell of Mjj has state Qa at step r = s{n)t{n) (i.e., 
state{r, 1) = qa). 

Note that state{T, i) depends on 2s(n) -I- 1 states at step r — s{n) as follows: 

state{T — s{n),i — s(n)), . . . , state(r — s(n),i), . . . , state{T — s{n),i + s(n)) 

Track G contains these 2s{n) + 1 states, which are guessed existentially. Note 
that these states can be generated in parallel within 0(1) steps. Furthermore, 
in a subtrack of T, existentially guesses the new value t = t — s{n) as a 
binary string, which can also be generated in 0(1) steps. 

(4) Ma performs the following procedures (4.1), (4.2), and (4.3) universally. 

(4.1) Ma verifies whether the guessed value r is equal to r — s(n). Since r 
and s(n) can be represented by binary strings of length 0(logt(n)), the value 
of r — s(n) can be computed in time polynomial in log t{n). If r = r — s(n), then 
Ma halts with an accepting state; otherwise Ma halts with a rejecting state. 

(4.2) Ma verifies whether the above 2s(n) -I- 1 states are consistent with the 
state in track S by making an s(n)-step simulation of Md. If they are consistent 
with the state in track S according to the transition function oi Md, then Ma 
halts with an accepting state; otherwise Ma halts with a rejecting state. 

(4.3) Ma assumes that r = r — s{n) and the 2s(n) + 1 states are consistent 

with the state in track S. Ma universally chooses one of the 2s{n) + 1 cells of 
track G as follows. (The state, say, q' , of the chosen cell will be the new state 
in track S. See the state 7 in Fig. EKb)) Ma universally generates a value j 
(where — s(n) < j < s(?^)) as a binary string in a subtrack of I within 0(1) 
steps. Then, Ma existentially generates a string of the form of 

length 2s{n) + 1, where —s{n) < j < s{n). The following (4.3.1) and (4.3.2) are 
universally performed. 

(4.3.1) Ma verifies whether (i) exactly one 1 appears in the generated string 
of length 2s{n) + 1 and (ii) this 1 appears at the (s(n) -I- j + l)st position (i.e., 
j = j). (i) can be verified by a single scan. In order to verify (ii), the symbol 1 
sends a unit-speed pulse toward the leftmost cell, and Ma decreases the value one 
by one from s{n) -|- j -I- 1 to 0 (using the counting algorithm in ^3), doing both 
simultaneously. If the pulse reaches the leftmost cell and the value becomes 0 
at the same time, Ma halts with an accepting state; otherwise, Ma halts with 
a rejecting state. (Note that the value of s{n) -I- j + 1 can be computed in time 
polynomial in logt(n).) 

(4.3.2) Ma stores the state q' of the chosen cell into a subcell of track S. In 
order to generate a string q'q' ■ ■ ■ q' of length 2s(n)-|-l, Ma existentially generates 
(guesses) an arbitrary string cr S Qd* of length 2s{n) + 1, where Qd is the state 
set of Md- Ma performs (4.3.2a) and (4.3.2b) universally. 

(4.3.2a) Every subcell of track S verifies (locally) whether the generated 
string is over one-letter alphabet, and the subcell at the (s(n) +j+ l)st position 
verifies whether the letter is q' . Ma halts with a rejecting state if cr q'q' ■ ■ ■ q'; 
otherwise, Ma halts with an accepting state. (The rejecting pulse reaches the 
leftmost cell in 0(s(n)) steps.) 
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Fig. 6. Tracks A and G 

(4.3.2b) Ma assumes that a = q'q' ■■■q', t = t — s{n), and j is correct. 
Ma regards t and z + j as r and i, respectively. As in (3), in track G, Ma 
existentially generates 2s(n) + 1 states in 0(1) steps. Then, in a subtrack of T, 
Ma existentially generates the new value t = t — s{n) in 0(1) steps. Again, Ma 
performs the procedure (4). 

During the above procedures, Ma simulates an lA which makes exactly t(n) 
moves in track C. (Recall that t{n) is a constructible function.) At the t{n)th 
recursive call of the procedure (4), Ma verifies whether 2s{n) + l states in track G 
are consistent with initial configuration of Mo (see track G in Fig. EJ, i.e., 

, state{0, —1) = #, state{0, 0) = oi, state{0, 1) = q, state{0, 2 ) = q, , 

where Oi G A is the first symbol of the input string, # is the boundary symbol, 
and q is the quiescent state. This verification is described as follows: First of all, 
Ma verifies whether the sequence of 2s{n) + 1 states in track G is a string in set 
{#}* U U {g}*. This can be verified by a single scan. Suppose that 

a subcell, say, gi, of track G contains ai € S. 

Let k be the smallest integer such that I < k and subcell in track A 
contains ai. In order to make an s(n)-step simulation of Mo, the subcell gi 
needs the input string 0402 ■ ■ ■ a*. Thus, Ma must shift the sequence in track A 
k — I positions to the left as follows. 

The leftmost subcells gi and ai send two pulses Pg and Pa, respectively. 
When the pulse Pg (resp. Pa) reaches gi (ofe), the subcell gi (a^) sends a pulse 
P'g (Pa) toward g\ (oi). Therefore, at steps 21 and 2k, these pulses reach gi 
and ai, respectively. Ma starts the firing squad synchronization algorithm at 
steps 21 and 2k. During the time period from the first synchronization to the 
second synchronization, Ma can shifts the sequence in track A k — I positions 
to the left. Also, Ma verifies whether the distance from gi to the center of the 
2s{n) + 1 cells is equal to the value i in track I. This can be verified by using 
a unit-speed pulse from gi toward the center cell and the counting algorithm 
in HU- The time complexity of this procedure is bounded by 0(s(n)). 

The procedure in the previous paragraph and procedures (1.1), (1-2), (2.1), 
(4.1), (4.2), (4.3.1) and (4.3.2a) are “leaves” of the computation tree of Ma- All 
these procedures are performed in at most 0{t{n)) steps. (Note that s(n) < t(n).) 
Procedures (1.3), (2.2) and (3) are performed once; their time complexities are 
0(t(n)). Each of the remaining procedures can be executed in 0(1) steps. Proce- 
dure (4) is executed t(n) times. Therefore, the time complexity of Ma is bounded 
by 0(t(n)). Since each execution of (4) simulates an s(n)-step computation 



Speeding-Up Cellular Automata by Alternations 249 



of Mo, alternating lA Ma can simulate an s{n)t{n)-step computation of Mo- 
As we mentioned in (2.2), the space complexity of Ma is bounded by 0{s{n)). 
This completes the proof of Theorem 2] 



5 Exponential Speedup Algorithm for BCAs 

In this section, we prove Theorem |21 Let Mjy be an arbitrary t(n)-time nonde- 
terministic BCA. We construct a linear-time alternating BCA, say, Ma, which 
can simulate M^. It should be noted that the value t{n) is bounded by 
(and thus logt(n) = 0{n)), since the number of configurations of any BCA is 
bounded by 2'=’(”). 

First of all, we give a sketch of the proof. Ma verifies whether there is a 
t(n)-step computation path of Mpf from the initial configuration to a final con- 
figuration as follows. Ma existentially generates the configuration of Mjv at 
step t{n)j2. Then, using a universal branch, Ma verifies whether both the first 
half computation and the second half computation are correct. Ma repeats this 
procedure [logt(n)] times recursively (see below for the details). In order to 
execute this algorithm, Ma must compute the value of |"logt(n)] in advance. 
Furthermore, computing the value [logt(n)] requires the value of n. For these 
reasons, Ma performs the procedure given in the following two paragraphs. 

(1) Using the same technique in [I I Ij . Ma generates binary string bin(n), 
which represents the value n. This can be done in 0{n) steps. Ma computes the 
value of [logt(n)] using the generated value n; Ma simulates a TM which, given 
bin{n), generates bin{\logt{n)~\) in time 0{n) and space 0{n). Note that the 
length of the input of the TM is [log n] . Using the linear-speedup theorem ini, 
this simulation can be done in n steps and n cells. 

(2) Ma uses tracks, say, U, V, W and X. For 1 < i < n, let Ui,Vi, Wi and Xi be 
subcells of Ci in tracks U, V, W and X, respectively. We denote the configuration 
of Mjv at step r by C'(r). We consider three integers u,v,w satisfying u = 0, 
V = \_{u + w)/2\, and w = t{n) . (Note that Ma does not have to compute these 
values.) 

Initially, track U contains the input string (i.e., initial configuration C(0) 
of Mat). Subcell wi contains an accepting state qa of Mn and the remaining 
subcells W 2 , ■ ■ ■ ,Wn contains n — 1 states, which are guessed existentially. These 
n states can be generated in one step in parallel. (Subcells Wi, W 2 , • ■ • , Wn corre- 
spond to an accepting configuration C{t{n)).) Also, subcells V\,V 2 , • ■ ■ con- 
tains n states which are guessed existentially. (These states correspond to C(v).) 

(3) The leftmost cell universally generates a symbol s S {0, 1}. Then, Ma 
existentially generates a string a G {0, 1}* of length n in track X. Ma performs 
the following procedures (3.1) and (3.2) universally. 

(3.1) Ma verifies whether a is over one-letter alphabet, and the leftmost cell 
verifies whether the letter is s. Ma halts with a rejecting state if cr ss ••• s; 
otherwise Ma halts with an accepting state. (The rejecting pulse reaches the 
leftmost cell in n steps.) 
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(3.2) Under the assumption a = ss ■ ■ ■ s, Ma stores the state of every sub- 
cell Vi into Wi (resp. Ui) if Xi contains s = 0 (resp. s = 1). (Consider updated 
integers u := u, w := v, and v := [(m -I- w)/2\ (resp. u := v, w := w, and 
V := [(m -I- w)/2\).) Ma existentially generates n states in parallel in one step, 
which are stored into subcells v\,V 2 t ■ ■ ,Vn- (These states correspond to config- 
uration C{v)). Again, Ma performs the procedure (3). 

During the above procedures, Ma counts the number of calls of (3) by de- 
creasing the value one by one from |"logt(n)] to 0. When the value becomes 0, 
Ma changes the configuration in track U into a one-step-after configuration ac- 
cording to the transition function of If the one-step-after configuration is 
equal to the configuration in track W, then Ma halts with an accepting state; 
otherwise Ma halts a rejecting state. (The accepting or rejecting pulse reaches 
the leftmost cell in n steps.) 

The procedure in the previous paragraph and procedure (3.1) are “leaves” of 
the computation tree of Ma, which can be done in 0(n) steps. Procedures (1) 
and (2) can be performed once; the time complexity of them is 0{n). Each of the 
remaining procedures can be performed in 0(1) steps. Procedure (3) is executed 
l’logt(n)] times. Since logt(n) = 0(n), the time complexity of Ma is bounded 
by 0{n). This completes the proof of Theorem 0 
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Abstract. In order to obtain universal classical cellular automata in- 
finite space is required. Therefore, the number of required processors 
depends on the length of input data and, additionally, may increase dur- 
ing the computation. On the other hand, Turing machines are universal 
devices which have one processor only and additionally an infinite storage 
tape. Here an in some sense intermediate model is studied. The push- 
down cellular automata are a stack augmented generalization of classical 
cellular automata. They form a massively parallel universal model where 
the number of processors is bounded by the length of input data. 
Efficient universal pushdown cellular automata and their efficiently ver- 
ifiable encodings are proposed. They are applied to computational com- 
plexity, and tight time and stack-space hierarchies are shown. 



1 Introduction 

Arrays of automata can be understood as models for massively parallel com- 
puters. By treating them as acceptors for formal languages their computational 
power can be compared with other parallel and sequential computer models. 
Under these aspects the automata arrays and various modifications have been 
studied for a long time. Especially investigations concerning universality (of- 
ten combined with other properties) have been done e.g. in |1I5I11I12I14I15| . 
A state-of-the-art survey on universality and decidability versus undecidability 
in cellular automata and several other models of discrete computations can be 
found in cni. 

Due to the historical precedent for a fixed amount of memory per cell (un- 
bounded) cellular automata have to be defined over an infinite space in order to 
obtain computational universality. Therefore, the number of required processors 
depends on the length of input data and, additionally, may increase during the 
computation. From a more practical point of view an infinite number of pro- 
cessors seems to be fairly unrealistic. On the other hand Turing acceptors are 
computationally universal devices which have one processor only and addition- 
ally an infinite storage tape. For this reason and due to the possible speed-up 
gained in parallelism we investigate the pushdown cellular automata (PDCA) 
where each cell is now a deterministic pushdown automaton rzE]- So we obtain 
a computationally universal computer model where the number of processors is 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 252-|2^^ 2001. 

(c) Springer- Verlag Berlin Heidelberg 2001 
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bounded by the length of input data. Furthermore, in our opinion the assumption 
of arbitrarily large pushdown memory is less problematical than the assumption 
of an arbitrarily number of processors. 

Clearly, a PDCA with at least two cells is sufficient in order to obtain an 
universal device. But with an eye towards applications e.g. in diagonalization 
proofs here we are interested in efficient universal PDCAs. 



2 Basic Notions 

We denote the integers by Z, the positive integers {1,2, ...} by N and the set 
N U {0} by Nq. The empty word is denoted by A and the reversal of a word w 
by w^. For the length of w we write |w|. We use C for inclusions and C if the 
inclusion is strict. proj^{x\ ■ ■ ■ Xn) = Xi selects the Ah component of a word or a 
vector. 

A pushdown cellular automaton is a linear array of identical deterministic 
pushdown automata, sometimes called cells, where each of them is connected to 
its both nearest neighbors (one to the right and one to the left). For convenience 
we identify the cells by positive integers. They operate synchronously at discrete 
time steps. The state transition of a cell depends on the current states of its 
both neighbors, the current state of the cell itself and the current symbol at the 
top of its stack. With an eye towards language recognition we provide accepting 
and rejecting states. More formally: 

Definition 1. A pushdown cellular automaton (PDCA) is a system 
(S,G,Ss,Sp,#,-L,A,F+,F_), where 

1. S is the finite, nonempty set of states, 

2. G is the finite, nonempty set of stack symbols, 

3. # ^ S is the boundary state, 

4- -L G G is the bottom-of-stack symbol, 

5. A C S is the finite, nonempty set of input symbols, 

6. F+ and F_, F^ fl F_ = 0, are the sets of accepting and rejecting states, 
respectively, 

7. (5s : (S' U 1#})^ X G — >■ S is the local state transition function, 

8. (5p : (S U 1#})^ X G — )> {A} U G U G^ is the local stack transition function 
satisfying Vsi, S2, S3 G S, g G G \ |T} : 

^p(si,S 2 ,S 3 ,T) e {g'l. I g' G (G\ |T}) U {A}} and 
Sp{suS2, S3,g) G {A} U (G \ |T}) U (G \ {T})^. 

The condition on the local stack transition function ensures that the bottom- 
of-stack symbol appears at each cell exactly once (i.e. at the bottom of its stack). 
At every transition step each cell consumes the symbol at the top of its stack 
(if it is not empty) and pushes at most two new symbols onto it. Note that the 
restriction of pushing at most two symbols at every time step neither reduces 
the computation power nor slows down the computation itself 0. 
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Let A4 = {S,G,6s,6p,#,-L,A,F+,F-) be a PDCA. A configuration of M 
at some time t > 0 is a mapping Ct : [1, . . . ,n] — >• S x G+ for n G N, that 
maps the cells to their current states and stack contents. The configuration at 
time 0 is defined by the input symbols and empty stacks. For a given input 
w = ai ■ ■ ■ Un G A+ we set co(z) = (ai,_L),l < i < n. Subsequent configurations 
are computed by synchronously applying both local transition functions to all 
cells in parallel. The virtual left neighbor of the leftmost cell and the virtual 
right neighbor of the rightmost cell are always assumed to be in the boundary 
state. 

3 Encoding of Pushdown Cellular Automata 

Needless to say, in general it is possible to encode PDCAs and their configu- 
rations with any nonempty alphabet. But with an eye towards applications in 
formal language recognition, we are interested in efficiently verifiable encodings 
that have to be chosen with respect to the processing universal PDCA. Later 
on, the encodings in combination with the universal PDCA will determine the 
tightness of the hierarchies of complexity classes. 

Let bin : Nq — >■ {0, 1}+ be the mapping that maps a natural number to its 
binary representation without leading zeroes. Then the binary representation 
with leading zeroes is for all fc > 1 defined by bin^ : No — >■ {0, n i— >■ 
0^“|bm(”)lbin(n) if A: > |bin(n)|. binfc(n) is undefined for k < |bin(n)|. 

S and G are finite nonempty sets and we can assume total orderings on 
their elements: S = {si, . . . , S|s|} and G = {gi, . . . , g\a\}- W.l.o.g. let sq denote 
the boundary state # and gi be the bottom-of-stack symbol _L. The different 
beginnings of the numberings have been chosen since for stack operations the 
empty word A has to be encoded in addition. The ordering of the state set implies 
orderings of F_|_ and F_. 

The state and stack transition functions can be represented as a table with 
seven columns. Each row contains the states of the left neighbor, of the cell itself 
and of the right neighbor and the symbol at the top of the stack, followed by 
the next state and the two symbols (which may be A) that form the new top of 
the stack: 

Si Sj Sk gi Ss{si,Sj,Sk,gi) proji{Sp{si,Sj,Sk,gi)) proj 2 {Sp{si,Sj,Sk,gi)) 

We assume that the rows are in lexicographic order. If for certain Sj, Sj, Sk, gi 
the corresponding row is missing then 6s{si, Sj, Sk, gi) is defined to be Sj and 
Sp{si,Sj,Sk,gi) is defined to be gi (i.e. neither the state nor the stack content 
changes) . 

Let k = |bin(max{|G|, |S'|})| and G = {0, 1, [, ] , +, -, b. A}. 

1. The states, stack symbols and the empty word are encoded by a pair of 
square brackets that contain the corresponding binary representation with 
leading zeroes followed by a mark (+, -,b) that indicates that the coded 
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symbol belongs to F^, F_ or not at all. Thus, we obtain a mapping as 
follows: 

codes : 5 U {#} U G U {A} ^ 

{ [binfc(i)+] if p = Si € S A Si € F+ 

[binfc(i)-] if p = Si e S A Si e F- 

[binfc(i)b] if p = Pi eGVp = Si eS \ {F+ U F_) 

[binfc(0)b] if p=AVp = # 

2. An ordered subset F = {/i, . . . , fm} C 5 is encoded by the concatenation 
of the codes of its elements: 

codes (F) = codes (fi) ’ • • codes (fm) 

3. A row of the transition table is now encoded by writing the codes of the 
symbols on 7 tracks which implies an alphabet G^ for the code: 

coder T Pn^ Po') — 

{proji{codes{si)),projj^{codes{sj)), . . . , proj j^{codes{po))) 



{projk+:i{codes{si)),proji^^^{codes{sj)), . . . ,proji^^^codes{po))) 

4. Consequently, the whole table with m rows is encoded as the 

concatenation of the codes of the rows: 

codes = coder (ri) ■ ■ ■ coder (vm) 

5. Hence, a PDCA M is encoded by the codes of the state S\s\ and the stack 
symbol p\Q\ on two tracks (which imply the number of states and stack 
symbols) followed by the codes of F_|_, of F_ and of the transition table. The 
whole is enclosed in square brackets on each track: 

code (M) = 

C {proj 1 {codes ( s | s| ) ) , proj ^ {codes (ff | G| ) ) ) 



{proj k+3(codes{s\s\)), proj ,,_^_3{codes{p\G\))) 

codeF{F+)codeF{F-)codes{S)'i ^ 

For easier reading we regard the encoding as a word over G^ (i.e. all the not 
used registers are filled with A) . 

Example 2. S = {si, S 2 , S 3 , S 4 }, G = {±,^ 2 }, F+ = { 52 }, A- = { 33 , 54 } and 
<5s(si, si. So, pi) = S 2 and <5p(si, si, sq, 51 ) = T define the only row of the transi- 
tion table. The encoding is as depicted in Figured 

It is not hard but a technical challenge to prove that the language {w G 
(G^)+ I 3PDCA A4 : w = code{A4)} of all PDCA encodings is recognizable by 
some PDCA in real-time. 
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Fig. 1. Example encoding of a PDCA. 



4 Encoding of Configurations 

By designing an efficiently verifiable encoding of PDCAs we have done the first 
step towards an efficient universal PDCA. Given the encoding of an arbitrary 
PDCA A4 and the encoding of an input word w a universal PDCA has to simulate 
the behavior of A4 on input w. Since in general w is independent of A4 at 
first it will be necessary to create the encoding of the initial configuration of 
At with respect to w. Since the universal PDCA has to handle PDCAs with 
arbitrarily large state sets the space requirement may be arbitrarily large during 
the simulation. Since the number of available cells is bounded by the length of 
the input the encodings have to be stored into the stacks. 

The following lemma solves a pattern transformation problem that will be 
utilized for the generation of initial configurations. 

Lemma 3. m Let A be an alphabet, h ^ A, w = gq ■ ■ ■ Un-i S A~^ and m S N. 
Then there exists a PDCA that transforms a configuration co{i) = Ui, 0 < i < 
n — 1, and co{i) = b, n < i < n + m, into the configuration ct{i) = Oimodn, 
0 < i < n + m, within t < 4(n + m) time steps. 
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Fig. 2. Example encoding of a cell. 

The encoding of a single cell in a configuration is an element from {b} x 
(((^5)fc)+ thus, well-suited for later processing. The second component is 
the stack content, which in turn consists of the codes of the current states of the 
cell itself and of its neighbors and the current stack content itself. 
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These codes are written on 5 tracks from top to bottom in reversed form 
(cf. Example 0). The fifth track is for later use and initially empty. So are the 
tracks 1, 2 and 3 lower down the stack since the codes for the states are stored 
only once. Let k = |codes(s)| be the length of the codes for states. For a cell i at 
time t let Ct(i- 1 ) = {sh,Ph,i ■ ■■Ph.m.h), Ct(i) = and Ct(i + 1 ) = 

{sj,Pj^i ■ ■ ■Pj,mj)- Then the encoding of cell i at time t is: 

codedi, t) = 
b, 

{projk{codes{sh)),projdcodes{si)),projdcodes{sj)),projdcodes{pz,i)),h) 

(projdcodes{sh)),projdcodes{si)),projdcodes{sj)),projdcodes{pi,i)),'b) 

(b, b, b, proj {codes fe. 2 ) ) , b) 

(b, b, b, proj 1 {codes{pi, 2 )), b) 

(b, b, b, proj {codes fe. 3 )) , b) 

(b, b, b, proj 1 {codes {Pi,rm )),b) 

Consequently, the encoding of a configuration ct is the concatenation of the 
encodings of the cells: 

codedct) = codec(l, t)codec{2, t) ■ ■ ■ codedn, t) 



Example j. S = {51,32,33,54}, G = {±,32}, T+ = { 52 }, = {33,54}. Let 

Ct{i- 1 ) = (52,52-L), Ct{i) = ( 53 , 3252 -L) and Ct{i+ 1 ) = (34, T) then codedht) 
is: 

b, 

(],],],],b)(+,-,-,b,b)(0,0,l,0,b)(l,l,0,l,b)(0,l,0,0,b)([, [, [, [,b) 

(b, b, b, ] , b)(b, b, b, b, b)(b, b, b, 0, b)(b, b, b, 1, b)(b, b, b, 0, b)(b, b, b, [, b) 

(b, b, b, ] , b)(b, b, b, b, b)(b, b, b, 0, b)(b, b, b, 0, b)(b, b, b, 0, b)(b, b, b, [, b) 

It will be stored in a single cell and its stack as depicted in Figure |21 

5 Universal Pushdown Cellular Automata 

This section is devoted to the construction of an efficient universal PDCA. The 
following first construction yields a PDCA U that, given the encoding of a PDCA 
M. and the encoding of a configuration Ct of At, computes the encoding of the 
successor configuration Ct+i of At within some r time steps. 

lA simulates one transition step of At in three phases. During the presimu- 
lation phase some of the tracks are initialized. The successor states and stack 
symbols of At are computed during the simulation phase. Subsequently, during 
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the postsimulation phase a valid configuration is computed such that the next 
step of M can be simulated. 

Presimulation phase: U has five main tracks which may be divided into sub- 
tracks. If in the sequel the lengths of some blocks do not divide the number of 
cells or if the construction uses a constant number of cells beyond the borders it 
is assumed that there exist another five main tracks that are used as the folded 
extensions of the corresponding tracks in order to cope with that (constant) 
space problem. 

The cell that contains the last symbol of code{Ai) can identify itself by the 
situation ]^]^. The same holds for the cell that contains the first symbol of 
codes{6) by the situation ] \J (cf. Figure Pi. 
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Fig. 3. An inner section after the presimulation phase. 

The cells that contain codedct) are divided into sections of length \codes{5)\ 
such that codes(S) can be made available on track 2 in each section. Each section 
is divided into blocks of length | codes (s)|, lets say of length k. The result of the 
presimulation phase is depicted in Figure 0 

By Lemma0the sections can be created by concatenations of codes{S). Dur- 
ing that process the inscriptions of track 4 and 5 can also be generated. 

At the end of the presimulation phase a global FSSP is started on the first 
track such that all the cells synchronously start the simulation phase. This takes 
2{\code{M)\ + |codec(ct)|) time steps. Thus, altogether the presimulation phase 
needs t < 4(|code(A4)|-|-|codec(ct)|)-|-2(|code(Ad)|-l-|codec(ct)|) = 6(|code(A4)|-|- 
|codec(ct)|) time steps. 

Simulation phase: In the sequel all signals and labellings are realized on the 
first main track (resp. its subtracks). The simulation is performed in all sections 
in parallel. Therefore, it suffices to explain the process for one section. 

On the fifth track modified FSSPs are performed. [ and ] are generals that 
synchronize the cells in between them every k (i.e. codes(s)) time steps. After 
each other synchronization the process is one time step delayed. 

During such a cycle all cells of codec{ct) with on their second track are 
working as follows (cf. Figure 0 for the general behavior). 

During each time step the top-of-stack symbol is copied onto five subtracks 
of the third track. At the same time the contents of the tracks 2 and 3 are shifted 



Efficient Universal Pushdown Cellular Automata 259 



to the right. Since on the second track there is the encoding of one of the rows 
of the transition table the cell can successively test whether the row matches its 
current situation. 

After k time steps the FSSP fires and the contents of the second and third 
track now are successively shifted to the left. Therefore, the cell can push its old 
state and top-of-stack symbol back into its stack if the row did not match the 
current situation, or the new state and top-of-stack symbol(s) otherwise. The 
test is finished when the FSSP fires again. During the delay step the contents of 
track 2 and 5 are shifted one cell to the right and the next cells will be tested 
during the next 2k time steps. 

The process has to be repeated until all cells of the section have been tested 
with all blocks of codes{S). This needs \codes{6) \ ■ {2k + 1) time steps. 

In order to recognize the end of the simulation phase on track 4 a FSSP 
is performed that synchronizes the whole section. One transition of the FSSP 
is computed at every time the block FSSP has finished a test cycle. Thus, the 
section is exactly synchronized after \codes{6) \ ■ {2k + 1) time steps. 




Fig. 4. A test cycle in the simulation phase. 
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Postsimulation phase: At the beginning of this phase the situation at the top 
of the stacks is as depicted in Figure 0 

The process that updates the stack contents is similar to the simulation 
phase. During k time steps the contents of the top of the stack of three adjacent 
cells are successively copied onto the third track. Then during another 2k time 
steps the stack content of the inner cell is updated appropriately. 

The postsimulation phase needs k such update cycles, hence, fc(3fc + 1) time 
steps. 




Fig. 5. Stack content at the beginning of the postsimulation phase. 

Theorem 5. Let n denote the length of the input of lA , then lA simulates I 
transitions of a PDCA A4 within t < 6n + 7l\codes{6)\^ time steps. 

Proof. The theorem follows from the fact that the presimulation phase has only 
to be performed once. Let k = |code 5 (s)|. From the construction we obtain: 

t < 6{\code{M)\ + \codec{ct)\) + l{\codes{5)\ ■ {2k + 1) + fc(3fc + 1)) 

< 6n + l{\codes{S) \ ■ 2k + \codes{S) \ + Sk^ + k) 

< 6n + l{2\codes{6)\‘^ + \codes{S)\ + S\codes{S)\^ + \codes{6)\) 

< 6n + l{5\codes{S)\'^ + 2\codes{5)\) 

<t!>n + 7l\codes{5)\^ □ 

The PDCA lA works fine if its input is the encoding of a PDCA and 
the encoding of a configuration of Ai. In the following this precondition is too 
restrictive since for diagonalization proofs we need to consider the behavior of 
PDCAs when they get their own encoding as input. 

The following construction yields a PDCA V that, given the encoding of a 
PDCA Ai, computes the encoding codec(co) where cq is the initial configura- 
tion of Ai with input code{Ai). Subsequently, V computes the encoding of the 
successor configuration Ci of Ai and so on. 

The first task of V is to compute the encoding of the initial configuration of 
Ai. The second task is to simulate the universal PDCA lA in order to compute 
encodings of successor configurations of Ai . 
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The construction of U has been done under the assumption that code{M) 
is located at the left of the encoding of the configuration. This situation can 
be emulated as follows. We assume that for each track there exists another one 
which is regarded as the extension of the track. Both tracks are connected at the 
left border. So it suffices to write the mirror image of code{M) on the extension. 
This can be done in \code{M)\ time steps. 

Subsequently, V simulates the presimulation phase of U where codes{S) is 
concatenated in order to initialize the sections. Parallel to this process all cells 
i compute codec{i,0) which completes the first task of V. This computation is 
now explained for one of the stack registers. 

During the presimulation phase codes (S) is moved across the cells. Cell i with 
input Si G C “knows” bin(f). When it receives a ] it waits for |bin(f)| time steps 
and subsequently pushes Os into the stack until it receives a [. Now it pushes 
bin(i) into the stack. 

If codeF{F+) and codepiF-) are also moved across the cells, the encoding of 
Si can be completed simply by successively testing whether Si belongs to one of 
the sets and by pushing the appropriate symbol +, - or b followed by a [. 

Theorem 6. V simulates I transitions of a PDCA A4 that operates on its own 
encoding within t < 13n + 7l\codes{5)\'^ time steps, where n denotes the length 
of the input ofV. 

Proof. From the construction follows: U needs \code{M)\ = n time steps to 
create the mirror image of code{M). Subsequently, V simulates the presimulation 
phase of U in which additionally the encoding of the configuration is computed. 
Since we are concerned with extended tracks the time has to be doubled. Thus, 
this phase needs 12n time steps. After the presimulation phase V simulates 14 
directly and the theorem follows. □ 



6 Tight Hierarchies of Language Families 

Subject of this section are tight time and space hierarchies. Since the number 
of cells is fixed by the length of the input the space complexity is measured as 
stack-space. 

Definition 7. Let M = (S,G,6s,6p,#,-L,A,F+,F-) be a PDCA. F+ U F_ is 
the set of final states. 

1. A word w G A+ is accepted resp. rejected by Ai if at input w the leftmost cell 
of A4 becomes final and if its first final state is an accepting resp. rejecting 
state. 

2. L{M) = {w G A+ I w is accepted by Ai} is the language accepted by Ai. 

3. Let t : N — >■ N, t{n) > n, be a function. A PDCA is said to be t-time-bounded 
or of time complexity t iff every input of length n after at most t{n) time 
steps is accepted or rejected. 
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4- Let g : N — > N &e o function. A PDCA is said to be g-stack-space-bounded 
or of space complexity g iff every input of length n is accepted or rejected 
at some time t and for all t' < t each of the stacks contains at most g(n) 
symbols. 

The family of all languages which can be accepted by PDCAs with time com- 
plexity t resp. space complexity g is denoted by ^t(PDCA) resp. g.Sf(PDCA). 

In order to prove infinite tight hierarchies in almost all cases honest resource 
bounding functions are required. Usually the notion “honest” is concretized in 
terms of the computability or constructibility of the function with respect to the 
device in question. 



Definition 8. A function / : N — >■ N is computable if there exists a PDCA A4 
with input alphabet A and constant stack-space complexity such that A4 recog- 
nizes all w € A+ in exactly /(|u>|) time steps. 



Thus, computability of / means that there exists a PDCA that for any input 
w from A+ can distinguish the time step /(|w|) without using its stacks (i.e. a 
classical cellular automaton). As usual here we remark that the class of such 
functions is very rich |2|'-iyt)|1 dj . 

Now we are prepared to prove the tight time hierarchy. 



Theorem 9. Let t : N — )> N and t' : N — >■ N be two functions. Lf t is computable 
and lim inf„_>oo = 0 then there exists a language L such that 

L e (PDCA) \ .if, (PDCA) 

If additionally Vn G N : t'{n) < t{n) then jSf,/(n,)(PDCA) C .jSf,(n,)(PDCA). 

Proof. Let W be a PDCA that works as follows. At first W checks whether or 
not its input belongs to the language L' = {uv | z; S {0, 1}+ A 3PDCA M : u = 
code{M)} . Since the cell that contains the last symbol of u can identify itself 
this verification needs at most |mz; | time steps. 

Subsequently, W performs two tasks in parallel. One is to simulate the com- 
putation of M with input uv as has been shown by the construction for The- 
orems |51 and The second one is to distinguish the time step t{\uv\) since t is 
computable. 

W rejects its input if uv ^ L' or if after t(\uv\) time steps the simulation 
of M has not produced a decision. If, on the other hand, W recognizes during 
t(|Mu|) time steps that M did its decision, then W rejects if M accepts and vice 
versa. Thus, L(W) G .if,(PDCA). 

Contrarily to the assertion we assume that there exists a PDCA W' with u = 
codeiyV') that recognizes L(W) with time complexity t' . By TheoremElW needs 
ki\uv\-\-k 2 t' {\uv\)k1 time steps in order to simulate t'{\uv\) transitions of W'. ki, 
/c 2 and fca are constants that depend on W'. W.l.o.g. we may assume t'{\uv\) > 
|uu|. Therefore, the time complexity of W is ki\uv\ -\- fc 2 t'(|uu|)fc| < k 4 t'{\uv\) 
for a suitable constant k^. From the limes inferior we obtain a z;' G {0, 1}+ 
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such that k 4 t'{\uv'\) < tdun'l). Therefore, W can simulate t'{\uv'\) transitions 
of W within t{\uv'\) time steps. If W accepts the input uv' in t'{\uv'\) time 
steps then W rejects. If W rejects the input within t'{\uv'\) time steps then W 
accepts. This is a contradiction to the assumption that W' accepts L(W) with 
time complexity t' . □ 

Without proof we state the tight space hierarchy: 

Theorem 10. Let (/ : N — >■ N and g' : N ^ N be two functions. If g is com- 
putable and liminf„_»oo ~ ® then there exists a language L such that 

L G g(„)if(PDCA) \ ,,(„).if(PDCA) 

If additionally Vn G N : g'{n) < gin) then g/(„).j2’(PDCA) C g(„).jSf(PDCA). 
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Abstract. We are given a network of identical processors that work 
synchronously at discrete time steps, a Cellular Automaton. Processors 
are arranged as an array of m rows and n columns with the constraint 
that they can exchange only one bit of information with their neighbours. 
We study the problem to synchronize the cellular automata, that is to 
let all the processors enter the same state for the first time at the very 
same instant. This problem is also known as “The Firing Squad Synchro- 
nization Problem”, introduced by Moore in 1964. We give algorithms to 
synchronize two dimensional Cellular Automata of (n x n) cells at some 
given times: n^, n[logn], n\^Jn^ and 2". Moreover we also give some 
general constructions to synchronize Cellular Automata of (m x n) cells. 
Our approach is a modular description of synchronizing algorithms in 
terms of “fragments” of cellular automata that are called signals. 



1 Introduction 

The Firing Squad Synchronization Problem (shortly FSSP) is the problem of 
synchronizing a Cellular Automaton (CA), a network of identical cells (finite 
automata) that work synchronously at discrete time steps. A general formulation 
of this problem is the following: initially a distinguished cell (the general) starts 
computing while all others are in a quiescent state; at each time step any cell 
sends/receives to/from its neighbours some information about their state at the 
preceding time; the problem is to let all cells in the network enter the same state, 
called firing, for the first time at the very same instant. 

The FSSP was introduced by Moore in 1964 jS] as the problem of synchro- 
nizing a linear CA, where at each step each cell transmits its current state to its 
two adjacent cells. (Its name is due to the fact that the line of cells can be seen as 
a line of soldiers that have to fire simultaneously.) The early results all focused 
on the synchronization in minimal time of a linear CA. A significant amount 
of papers also dealt with some variations of the FSSP that concerned both the 
geometry of the network and computational constraints. Besides minimal-time 
solutions to the FSSP, also solutions at a predetermined (non minimal) time 
have been considered m- This is an interesting and challenging theoretical 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 264-|22^ 2001. 
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problem, which is also directly connected to the sequential composition of cel- 
lular automata. Given two cellular automata Ai and A2 computing respectively 
the functions fi{x) and f2{x), the sequential composition of Ai followed by A2 
is the cellular automaton obtained in the following way: first Ai starts on a 
standard initial configuration and when it has done with its computation, A2 
starts using the final configuration of Ai as initial configuration. The resulting 
automaton clearly computes f2ifi{x)). In order to compose the two automata, 
it is necessary to synchronize all the cells that will be used by A2 at the time 
Ai computes fi{x) for a given input x. Solutions to the FSSP at a given time 
have been studied on rings and toroidal square arrays with unidirectional flows 
of information [ 3 | , and on lines of cells exchanging only one bit at each time step 

. 

In this paper we study non minimal time solutions to the FSSP. We consider 
a one bit 2 -dimensional cellular automaton, that is a grid of (m x n) identi- 
cal finite-state processors (cells) which exchange one bit of information each 
other. A synchronization in time t(m, n) of an array of (m x n) cells is a one 
bit 2 -dimensional cellular automaton such that, starting from a standard con- 
figuration, all cells enter for the first time the Firing state at time t(m,n). We 
obtain new synchronization times in a compositional way: we first describe basic 
synchronizing algorithms and then we give general rules to compose synchroniza- 
tions. The basic synchronizations in turn are obtained by composing elementary 
signals, which can be seen as fragments of Cellular Automata. A synchroniza- 
tion is thus a special signal obtained as a composition of many simpler signals. 
Compositional rules for both signals and synchronizations include parallel com- 
position, sequential composition, and iterated composition. We also state some 
sufficient conditions to apply these compositions. In the parallel composition we 
start many synchronizations or signals, all at the same time. Sequential composi- 
tion appends a synchronization or a signal to the end of another signal, possibly 
with a constant time offset. This way we are able to construct a synchroniza- 
tion in time ti(m,n) t2(m,n) d, for d > 0 , if there exist synchronizations 
in time ti(m,n) and t2(m,n). The iterated composition consists of iterating 
t2(m,n) times a synchronization in time t\(m,n) given a synchronization in 
time t2(m,n), thus obtaining a new synchronization in time ti(m,n) ■ t2(m,n). 

We also give a construction to “inherit” synchronizations on (m x n) arrays of 
processors from synchronizations of lines of k processors. We show that an (mxn) 
array of processors can be seen as many lines of (m -I- n — 1 ) processors (each of 
them having as endpoints cells ( 1 , 1 ) and (m, n)) where a synchronization can be 
executed simultaneously on all these lines. Thus we can synchronize an (m x n) 
array in time t(m n — 1 ), provided that there exists an algorithm for a linear 
array of k processors in time t(k). Finally, we give algorithms to synchronize an 
(n X n) square array in time nflogn], n\y/n) and 2 ", and we compositions 
of synchronizations to determine synchronizations in any “feasible” linear time 
and in any time expressed by a polynomial with nonnegative coefficients. These 
constructions use as building block synchronizations in time and in minimal 
time. 
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2 Preliminaries 

In this section we give the definitions needed in the rest of the paper. 

Basic Definitions. A one bit 1-dimensional cellular automaton (shortly 1-CA) 
is a line of n finite-state machines, called cells, which are identical except for 
those at the two endpoints. In a 1-CA, the i-th cell is connected to the {i — 1)- 
th and {i + l)-th cells, for all * = 2, . . . , n — 1. The first and the last cells are 
connected, respectively, to the second and the (n— l)-th cell. The 2-dimensional 
case is a natural generalization of the 1-CA. Omitting minor details, a one bit 2- 
dimensional cellular automaton (shortly 2-CA) is an array of (m x n) finite-state 
machines (cells) which are identical except for the boundary ones, and where cell 
(i,j) is connected to cells {i — 1, j), {i + 1, j), {i,j — 1) and (i,j + 1). The cells 
operate synchronously at discrete time steps. At each step each cell exchanges 
one bit of information with its adjacent cells and modifies its state depending 
on its current state and the bits sent by the adjacent cells at the previous step. 

In what follows, the symbol Q refers to the set of states of a given cellular 
automaton. A configuration of a 1-CA is a mapping C : {1, 2, . . . , n} — >■ {0, 1} x 
Qx {0, 1}. A configuration at time t gives, for each cell i, the state entered and the 
two bits sent at this time. A starting configuration is a configuration at time 1. 
The definition of configuration can be easily extended to a 2-CA, by considering 
that each cell sends the bits to its four adjacent cells. In the following we often 
write “{A, C)” to denote a 1-CA, or a 2-CA, A starting on a configuration C. 
Within the state set there are three distinguished state: G the General state, 
L the Latent state, and F the Firing state. State L has the property that if a 
cell in state L receives all bits 0 from its neighbours, it remains in the state L 
and sends bits 0 to its neighbours. A standard configuration is a configuration 
where each cell is in state L and sends bits 0, except for cell 1 (resp. cell (1, 1) 
) which is in state G and sends bits 1 to each neighbour. A synchronization in 
time t{n) of a linear array of n cells, is a 1-CA such that starting from a standard 
configuration all cells enter at time t[n) for the first time state F. Analogously a 
synchronization in time t(m, n) of a rectangular array of (m x n) cells is a 2-CA 
such that starting from a standard configuration all cells enter for the first time 
state F at time t{m,n). When m = n, we speak about a synchronization of a 
square array in time t(n). A synchronization of a linear array of n cells in time 
(2n — 1) is called a minimal time synchronization, since it can be easily proved 
that a synchronization is not possible in time less than (2n — 1). 

Signals. In ^ the concept of signal was introduced as a mean to design a 1-CA. 
Informally a signal describes the information flow in the space-time description 
of a cellular automaton, allowing a modular description of the synchronization 
process. The scheme used to present some synchronization algorithms in time 
t > 2n — 1 for a a linear array of n processors is the following: some signals are 
designed and composed in order to obtain an overall signal that starts from the 
leftmost processor and comes back to it in exactly (t — 2n + 1) time units; then 
a minimal time synchronization starts, thus synchronizing the n processors in 
time t. We consider the time unrolling of a 1-CA A starting on a configuration 
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C, that is we reason about a space-time array. A pair (i,t) of this array, with 
I < i < n and t > 1, is called a site, the state of the cell i at time t is denoted 
state{i,t) and the bits sent to the adjacent cells are denoted by left{i,t) and 
right{i, t). A site (i, t) is said to be active if either sends/receives a bit 1 to/from 
its neighbours or changes its state. We denote by Cell(A, C) the set of cells i 
such that site {i,t) is active for some t. 

Let A be a 1-CA and Che a, configuration. Define the time = max{t| (i , t) 
is active} and t™™ = min{t|(i,t) is active}. The set of sites for i G 

Cell(A, C) is called rear of (A, C) and the set of sites {i,t™^^) is the front of 
(A, C). Moreover we say that (A, C) is tailed if there exists a subset of Q, called 
tail(A,C) such that for all i G n}, state{i,t) G tail(A, C) if and only if 

{i, t) belongs to the front of (A, C). The states in tail(A, C) are called tail states. 
In words, a tail state appears for the first time on the front of (A, C). 

Two active sites (* 2 ,^ 2 ) are consecutive if t 2 = fi + 1 and *2 G {ii — 

1, ii,i\ + 1}. A simple signal of (A, C) is a subset S of consecutive sites with the 
property that if (A, C) is tailed, then {i, belongs to S. The union of a finite 
number of simple signals of a given (A, C) is called signal of (A, C). A graphical 
representation of a simple signal S is obtained by drawing a line between: 

(i) every pair of sites {i, t) G S and {i,t + l) G S and 

(ii) every pair of sites {i, t) G S and (i -I- 1, t -I- 1) G ^ (resp. (i — 1, t -I- 1) G S) if 
right{i,t) = 1 (resp. left{i,t) = 1). 

A graphical representation of a signal is obtained by the graphical representation 
of its simple signals. The length of a signal S is (t™®-’' — -|- 1) where = 

max{t|(z,t) G S', 1 < i < n} and t™'" = min{t|(i,t) G S, 1 < z < n}. Sometimes, 
in the rest of paper we refer to a signal without specifying a 1-CA and a starting 
configuration. 

In the following example we recall a signal introduced in 0. 

Example 1. Given a positive constant k < n, the signal MARK(n — k) is used to 
mark the cell n — k. The length of the signal Mark is n -|- k{see Figure ^ 1 . It 
can be easily seen that Mark is a signal of a tailed 1-CA. 

We recall now the signal composition. We say that a 1-CA A 2 on C 2 can follow 
a tailed 1-CA Ai on Ci if there exists a function h defined over tail(Ai, Ci) and 
such that h{p) = C 2 {i) if p = state{i,t). Given two signals and S 2 , we can 
define the concatenation cat^(S'i, S' 2 ) as the signal obtained by starting S\ at 
time 1 and S 2 at time r -|- 1, that is S 2 is delayed r time steps. More formally 
catr(S'i, 52) = S'! U {(z, t + r)|(z, t) G S' 2 }. 

The following remark recalls some sufficient conditions for the existence of a 
tailed 1-CA for a signal catr(S'i, 52 ). 

Remark 1. g] Let S'!, S '2 be signals of tailed 1-CA’s (Ai,Ci) and (A 2 ,C 2 ), re- 
spectively. The signal S = catr(S'i, S' 2 ) is the signal of a tailed 1-CA (A, C) if 
the following two conditions hold: 

1. (A 2 ,C 2 ) can follow (Ai,Ci); 

2. if the site (z, t) belongs to the front of (Ai, Ci) and (z, t') belongs to the rear 

of (A 2 , C 2 ), then t <t' + r. 
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cells 



QUAD(n-2) 

MARK(n-2) 



time 



Fig. 1. The signal cati (QuAD(n — 2), MARK(n — 2)). 



3 Synchronization of a Sqnare Array 

In this section we introduce two signals of a 1-CA. The first has a quadratic 
length and the second has an exponential length in the number of cells. Then 
we design synchronizations of a square array using the following scheme: first 
we synchronize the first row of the square array and then we simultaneously 
synchronize all the columns. That is the rows and the columns are seen as 1-CA. 
This way we obtain synchronizations of a square array of (n x n) cells in n^, 2", 
n [log n] and n [ ^/n ] . 

The signal Quad. Given a positive constant k < n, QuAD(n — fc) is a signal of 
a 1-CA A which is described as follows: 

— initially the cell 1 sends a bit 1 to the right; then if it receives a bit 1 from 
the right, it sends with a delay of one step (except for the first time, when 
there is no waiting), a bit 1 back to the right; the cell 1 eventually halts 
when it receives two consecutive bits 1; 

— for 1 < h < {n — k), the cell h sends a bit 1 to the left when it receives for 
the first time a bit 1 from the left; then, if the cell h receives again a bit 1 
from an adjacent cell, it sends a bit 1 to the other adjacent cell; 
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— the cell {n — k) sends two consecutive bits 1 to the left when it receives a bit 
1 from the left. 

The 1-CA A can be further designed such that it is tailed by observing that 
the cells from 1 to (n—k) can enter a tail state when they receive two consecutive 
bits 1. The length of the Quad signal is (n — k)^ — 1. 

Clearly, for the implementation of this signal cell (n — k) needs to be dis- 
tinguished. In what follows we will use QuAD(n — 2), thus we only need to 
distinguish cell (n — 2): this can be done by MARK(n — 2) and for all n > 5. 
Clearly for smaller n much easier ad hoc algorithms can be given, (see Figure 

The signal EXP. Given two positive constants k and c, we will define the signal 
Exp(n — k,c). 

An idle cell is a cell which never sends a bit 1 unless it receives a bit 1 from 
the left and in this case it sends two consecutive bits 1 to the left. 

Initially the only idle cell is the cell (n — k). Exp(n — k,c) is a signal of a 
1-CA which is described as follows: 

— first cell 1 sends a bit 1 to the right; then, whenever cell 1 receives a bit 1 
from the right, it immediately replies sending back a bit 1; finally, if cell 1 
receives two consecutive bits 1 from the right, then it changes into an idle 
cell; 

— for 1 < /i < (n — k), we distinguish two cases: 

• if the bit is received from the left then it alternates the following two 
behaviours: 

1. it sends a bit 1 back to the left, (let us call these peak cellsj^ 

2. it sends a bit 1 to the right; 

each peak cell starts counting from 1 to 2®+^ — 2, for 1 < i < c. When 
21+1 _ 2 ]^gg been just counted, if the peak cell receives a bit 1 from the 
left at the next time unit, then it is the Ath cell in the line and is marked 
(see below for an explanation). This way it can be distinguished later. 

• if a bit 1 is received from the right, then it sends a bit 1 to the left. If at 
the next time unit cell h receives another bit 1 from its right neighbour, 
then two other subcases need to be considered: 

if > c then the cell switches into an idle cell; 
else, for h < c, the cell sends two consecutive bits 1 to the left. (Note 
that when this case occurs, cells h < c have already been marked by 
step 2 above.) 

From the algorithm we have just described, a proof by induction on i < c 
can be given to show how a peak cell can be marked, in fact the following 
property holds: the length of the interval from the instant cell i is a peak cell 
for the first time and the instant it becomes a peak cell for the second time is 
2* + Ej=i 2-’ {i - j), (see Figure 0 where c = 3, cell 2 is marked at time 9 and 
cell 3 is marked at time 20.) 

^ Actually, this is a property of the state entered by this cell. 
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Fig. 2. The signals cati(Exp(5, 3), Mark( 5)) and cati(Exp(5, 1), Mark(5)) 



To implement a tailed 1-CA for Exp(n— fc, c) initially the cell (n—k) must be 
distinguished. In what follows we will use the signals Exp(n — 2,3) and Exp(n — 
2, 1): the cell n—2 can be distinguished by using MARK(n— 2), for n > 5. Observe 
also that the cells from 1 to (n — 2) can enter a tail state after they received two 
consecutive bits 1. The length of Exp(n— fc, c) is 2’^“*^+^ — 2(n— /c)— 2°+^+2(c+l) 
(see Figure I 2 I). 

We can now give the synchronizing algorithms for the square array. 

Theorem 1. There is a synchronization of an (n x n) square array in time . 

Proof. The algorithm is the following: first a signal cati(MARK(u— 2), QuAD(n— 
2)) is started on the first row, the length of this signal is (n — 2)^ since QuAD(n — 
2) is delayed one time step. This is a signal of a tailed 1-CA starting from a 
standard configuration (see Remark [Q. Thus after {n — 2)^ time units the cell 
(1, 1) enters a tail state, say G' . Considering G' as the General state, a minimal 
time synchronization on a linear array of n cells is executed on the first row and 
this takes other (2n — 2) time units. Once the Firing state F' is reached, we use 
F' as the General state of a minimal time synchronization that this time runs 
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on each column, thus taking another (2n — 2) time units, which adds up to a 
total time of n? . 

Theorem 2. There is a synchronization of an (n x n) square array in time 2". 

Proof. First a signal cati(Exp(n — 2,3),MARK(n — 2)) is started on the first 
row, see Figure 0 After (2"“^ — 2n — 3) time units the cell (1, 1) enters a tail 
state, say H. This is a signal of a tailed 1-CA starting from a standard configu- 
ration (see Remark P). Now the cell (1, 1) enters a state G' and a minimal time 
synchronization on the first row is accomplished, using G' as the General state, 
thus taking other (2n — 1) time units. Once the Firing state F' is reached, each 
cell of the first row enters a state G", and launches the signals MARK(n — 2) 
and Exp(n — 2, 1) on each column, using G" as the General state. This takes 
another (2"“^ — 2n + 5) time units, which sums up to time (2" — 2n-|- 1). Finally, 
a minimal time synchronization on each column is accomplished, thus reaching 
time 2". 

We can construct a synchronization in time nflogn] and in time n\-\/n~\ 
using signals of exponential and of quadratic length, by exploiting the inverse 
functions. (The algorithms resemble those used to synchronize a line of n cells 
at the same times shown in 0.) 

Theorem 3. There is a synchronization of a (n x n) square array in time 
n [log n~\ and in time n [ y/n ] . 

4 How to Obtain New Synchronizations 

In this section we discuss how to obtain new synchronizations of (m x n) arrays 
using known algorithms to synchronize linear arrays. We start describing syn- 
chronizations of an (m x n) array in time t(m + n — 1), given a synchronization 
of a line of k processors in time t{k). Then, we give some compositional rules 
on synchronizations of fm x n) arrays. We conclude this section showing how to 
construct synchronizations of a square array of processors in any arbitrary “fea- 
sible” linear time and in any time expressed by polynomials with nonnegative 
integer coefficients. 

Theorem 4. Given a synchronization of a line of k processors in time t{k), 
there exists a synchronization of an (m x n) array in time t(m + n — 1). 

Proof. An (m x n) array can be seen as many lines of (m -|- n — 1) cells, each of 
them having as endpoints cells (1, 1) and (m, n). Each of these lines corresponds 
to a “path” from cell (1, 1) to cell (m, n) going through (m -|- n — 3) other cells. 
Each cell (i,j) of these paths has as left neighbour either cell {i — l,j) or cell 
{i,j — 1) and as right neighbour either cell {i + 1, j) or cell {i,j + 1). 

Notice that the cell (i,j) is the {i + j — l)-th cell from the left in all the lines 
it belongs to. This property allows us to execute simultaneously on all these lines 
a synchronization in time t{k) for a line of k cells. Since the length of each line is 
{m + n— 1), we have a synchronization of the (m x n) array in time t(rn + n — l). 
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In 0 synchronizations for a linear array of n cells have been given in the 
following times: n?, 2", n\\ogn\, and n\^/ri\. Using these results and the above 
theorem we can give the following corollary. 

Corollary 1. Given an m x n array and K = m + n — 1, then 

— There are synchronizations of the {m x n) array in time , 2^ , K [logiL], 
and K\y/K~\ . 

— Let a and b be two integer numbers, if aK + b > 2K — 1 then there is a 
synchronization of the (to x n) array in time {aK + b). 

— Let h > 2 be an integer number and uq, ... ,ah natural numbers with > 1, 
then there is a synchronization of the (to x n) array in time a^K^ + . . . + 
aiK^ + oq. 

Given a 2-CA, we can consider its time unrolling and easily extend the 
definitions given in section |21 to signals for 2-CA. For a 2-CA (A, C), we call 
Links(A, C) the set of communication links effectively used by {A, C), that is all 
the ordered pairs of adjacent cells x, y such that there is a bit 1 sent from x to y 
at some time t. We give now some results on signal composition. The first lemma 
says that when two signals have disjoint sets of active communication links then 
it is possible to obtain a new signal which is their parallel composition. The 
second lemma generalizes Remark 1 to the 2 dimensional case and establishes 
when it is possible to design a 2-CA to concatenate two signals, thus obtaining 
their sequential composition. 

Lemma 1. Given an (to x n) 2-GA A, let Si and S 2 be two signals of A on 
configurations Ci and C 2 , respectively. Lf Links{A,Ci) fl Links{A,C 2 ) = 0 then 
there exist an (to x n) 2-GA A' and a configuration C' such that Si U 5'2 is a 
signal of {A' ,C'). Moreover, if{A,Ci) and (A,C 2 ) are tailed then also {A' ,C') 
is tailed. 

Lemma 2. Let <S'i,S '2 be signals of two 2-GA’s (Ai,Ci) and {A 2 ,C 2 ), respec- 
tively. The signal catr{Si, S 2 ) is the signal of a 2-GA (A, C) if the following two 
conditions hold: 

1. (Ai,Ci) is tailed and (A 2 ,C 2 ) can follow (Ai,^!); 

2. if the site {i,j,t) belongs to the front of {Ai,Ci) and {i,j,t') belongs to the 
rear of (A 2 , C 2 ), then t < t' -\- r. 

Moreover if {A 2 ,C 2 ) is tailed and Gell{Ai,Ci) C CeZZ(A 2 , C 2 ), then (A, C) 
is tailed too. 

We can now give the sequential and iterated compositions of synchronizations 
of (to X n) arrays. In the following, if A^ is a synchronization, then Gi, Li, and 
Fi are the General, Latent, and Firing states of Aj, respectively. 

Theorem 5. If Ai for i = 1,2 are two synchronizations of an (to x n) array 
in time ti{m,n) and d>Q, then there is a synchronization in time ti{m,n) + 
t 2 {m, n) + d. 
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Proof. Let Si be the signal of {Ai^Co), where Co is a standard configuration. 
From Lemma|21 if r = ti{m, n) + d, then there exists A such that catr(5i, S 2 ) is 
a signal of {A, Co). Moreover, catr(S'i, 5 * 2 ) is a synchronization in time t{m, n) = 
ti{m, n) + t 2 {rn, n) + d. 

Theorem 6. If Ai for i = 1, 2 are two synchronizations of an (m x n) array in 
time ti{m,n), then there is a synchronization in time ti(rn^n)t 2 (jn,n). 

Proof. We define a synchronization A consisting of an Iterative phase with 
length ti{m,n) which is executed t 2 (m,n) times. The set of states of A is 
Qi X Q 2 X {0,1}"^, the General state is (Ci, G 2 , 0, 1, 0, 0), the Latent state is 
{Li,L 2 , 0, 0, 0, 0) and the Firing state is (Fi, C 2 , 0, 0, 0, 0). In the Iterative phase, 
the synchronization A modifies the first component of its state according to the 
transition functions of Ai, until this component is Fi. At the end of this phase A 
executes a transition step modifying the second component of the state according 
to the transition functions of A 2 . Outputs of A 2 transition functions are saved 
in the last four components according to the order left, right, up, and down. 
Moreover, in this same step, A replaces Fi with either G± or Li (depending on 
whether the cell is the one triggering in the initial configuration the firing signal 
of A\) in the first component. So the Iterative phase can start again, until the 
Firing state is entered by all the cells. So, the synchronization A\ is iterated 
exactly t 2 {m,n) times and A takes time ti(rn,n)t 2 {rn,n). 

Let A be a synchronization in time t(m,n) and X x Y G m} x 

{1, . . . , n}, we say that A is {X x Y)-detectable if the states set state{i,j, t{m, n) — 
1), y{i,j) & X xY, is disjoint from the set of states j', t(m, n) — 1), for 

all (i',j') ^ X xY. Furthermore, we say that A has the parity property with 
respect to m (respectively, n), if for i = 1, . . . , m and j = 1 , ... ,n: 

— the set of states containing state(l, j, t(m, n) — 1), if m is even, (respec- 
tively, state{i, 1, t{m, n) — 1), if n is even) is disjoint from the set containing 
state(l,j,t(m,n) — 1) (respectively, state(z, 1, t(m, n) — 1)), otherwise; 

— the set of states containing state{m, j,t{'m,,n) — 1), if m is even, (respec- 
tively, state(i,n,t{'tn,n) — 1), if n is even) is disjoint from the set containing 
state{m,j,t{'nT'T'n) — 1) (respectively, state{i,n,t{'m,n) — 1)), otherwise. 

We recall that if we consider a line of n cells with an initial configuration 
state(l, 1) = state{n^ 1) = G and state{i, 1) = L for z 1, n, we can synchronize 
the line in time n, if n is odd, and time rz — 1 otherwise. We call such a synchro- 
nization a two-end synchronization. Mainly, in this synchronization the linear 
array is split in two halves and, starting at the same time, on both the halves a 
minimal time synchronization is executed (the composition of the corresponding 
signals is possible by Lemma [Q. 

Lemma 3. Let d> 0 and m > d (respectively, n> d). Let A he a synchroniza- 
tion of an (m x n) array in time t(rn, n) with the parity property with respect to 
m (respectively, n) and {X x Y)-detectahle for a set X xY = {X' x {1, . . . ,rz}) 
(respectively, X xY = ({1, . . . ,m} x Y')), where: 
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— X' = {1, . . . , d} U {m — d + 1, . . . , m} U {to/2, to/2 + 1} (respectively, Y' = 
{1, . . . , d}U (n — d+ 1, . . . , n}U (n/2, n/2 + 1}/, z/to (respectively, n) is even 
and 

— X' = {1, , d}U{TO— d+1, . . . , to}U{ |"to/2] } (respectively, Y' = (1, . . . , d}U 
{n — d + 1, . . . , n} U { \n/2\ } ), otherwise. 

Then there exists a synchronization of an (to x n) array in time t(rn, n) +m — d 
(respectively, t{m, n) + n — d) . 

Proof. We consider only the case that X is a synchronization of an (to x n) 
array in time t(m, n) with the parity property with respect to to, the other case 
is analogous. For d = 0, a synchronization in time t(m, n) + m consists of two 
phases. The initial phase is the synchronization A itself and the second phase 
is a two-end synchronization on (1, z), . . . , (to, i) for * = 1, . . . , n. By hypothesis 
at the end of the first phase the middle cells (or the shared middle cell, when 
TO is odd) of each vertical line are marked, so they can behave as the first and 
the last in the line. Moreover, the first and the last cells are aware of the parity 
of TO, so that they can be set in suitable General states, in such a way that 
the total time of the two-end synchronization is to in both cases. Thus from 
LemmaEI a synchronization in time t{m, n) + m exists. A synchronization A' in 
time t(m, n) + m — d can be obtained by modifying the previous synchronization 
in such a way that Ad jumps from the (t(rn, n) — l)-th configuration of A exactly 
to the d-th configuration of the two-end synchronization. Notice that in the d-th 
configuration of the two-end synchronization only the first d and the last d cells 
of each line are in states which are different from the Latent state. Thus, by the 
{X X y)-detectability of A, Ad is properly defined and is a synchronization in 
time t(m, n) + m — d. 

Lemma 4. Let A be a synchronization of an (to x n) array in time t{m, n) with 
the parity property with respect to m (respectively, n) and {X x Y)-detectahle 
for a set X X Y = {X' x {!,..., n}) (respectively, X xY = ({1, . . . , to} x Y') ) 
where: 

— X' = {m/2, m/2 + 1} (respectively, Y' = {n/2,n/2 + 1\) if m (respectively, 
n) is even, and 

— X' = II’to/2]} (respectively, Y' = ||’n/2]}/, otherwise. 

Then there is a synchronization of an (to x n) array in time mt(m,n) (respec- 
tively, nt{m, n) ). 

Proof. We consider only the case that A is a synchronization of an (to x n) ar- 
ray in time t(m, n) with the parity property with respect to m, the other case is 
analogous. We define a synchronization A' consisting of an iterative phase, with 
length t{m,n), executed to times. The iterative phase consists of the synchro- 
nization A. The states of Ad are tuples whose first component is a state of A 
and the second component is a state of the two-end synchronization applied to 
each row of the array. In the iterative phase. Ad modifies the first component of 
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a state according to the transition functions of A. At each iteration, just a step 
of a two-end synchronization is performed. Thus A' is a synchronization in time 
m t{m, n). 

In the rest of the section we present the polynomial time synchronizations 
of a square array of (n x n) processors. By the minimal time synchronization 
presented in jS|, we obtain the following result. 

Remark 2. There exists a minimal time synchronization of an (n x n) square 
array in time t{n) = 3n — 2 with the parity property and {X x K)-detectable for 
a set A X y = (X' x n}), where X' = {n/2, n/2 + 1}, if n is even, and 

X' = {[n/2]}, otherwise. 

Thus we have the following theorems. 

Theorem 7. Let a and b be natural numbers. There is a synchronization of an 
(n X n) square array in time 3n — 2 -|- a(n — 2) -|- 5. 

Proof. Directly by Remark |21 Lemma El (for d = 2), and Theorem 0 

Theorem [Oshows the existence of a synchronization in time n^, which includes 
a minimal time synchronization, thus the next remark follows. 

Remark 3. There exists a synchronization of an (n x n) square in time n^ with 
the parity property and {X x F)-detectable for a set A x T = (A' x (1, . . . , n|), 
where: 

— A' = |n/2, n/2 -|- 1}, if n is even, and 

— A' = {[n/2]}, otherwise. 

Finally we present synchronizations for any feasible polynomial time. 

Theorem 8. Let h > 2 be an integer number and oq, . . . ,ah natural numbers 
with a?i > 1 . There is a synchronization of an (n x n) array in time atn^ -I- . . . -I- 
oin^ -I- oq. 

Proof. From Remark Eland Lemma 0 a synchronization in time n^ can be ob- 
tained for every b > 2. Using Theorem 0 to compose these times, the theorem 
follows. 
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Abstract. P systems, introduced by Gh. Paun form a new class of dis- 
tributed computing model. Several variants of P systems were already 
shown to be computationally universal. In this paper, we propose a new 
variant of P systems, P systems with membrane creation, in which some 
objects are productive and create membranes. This new variant of P sys- 
tems is capable of solving the Hamiltonian Path Problem in linear time. 
We show that P systems with membrane creation are computationally 
complete. 



1 Introduction 

P systems are a class of distributed parallel computing devices of a biochemical 
type, introduced in which can be seen as a general computing architecture 
where various types of objects can be processed by various operations. In the 
basic model one considers a membrane structure with a main membrane, called 
the skin membrane, consisting of several cell-like membranes. If a membrane does 
not contain any other membrane, it is called an elementary membrane. We for- 
malize a membrane structure by means of well-formed parenthesized expressions, 
strings of correctly matching parentheses, placed in a unique pair of matching 
parentheses. Each pair of matching parentheses corresponds to a membrane. 
Graphically a membrane structure is represented by a Venn diagram without in- 
tersection and with a unique superset. The membranes delimit regions, where we 
place objects, elements of a finite set (an alphabet). The objects evolve according 
to given evolution rules, which are associated with regions. An object can evolve 
independently of the other objects in the same region of the membrane structure, 
or in cooperation with other objects. In particular, we can consider catalysts, ob- 
jects which evolve only together with other objects, but are not modified by the 
evolution (they just help other objects to evolve). The evolution rules are given 
in the form of multiset transition rules, with an optional associated priority rela- 
tion. The right hand side of the rules are of the form (a, here), (a, out), (a, iUj), 
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where a is an object. The meaning is that one occurrence of the symbol a is pro- 
duced and remains in the same region, is sent out of the respective membrane, 
or is sent to membrane j (which should be reachable from the region where the 
rule is applied), respectively. 

The application of evolution rules is done in parallel. Starting from an initial 
configuration (identified by the membrane structure, the objects - with multi- 
plicities - and rules placed in its regions) and using the evolution rules, we get 
a computation. We consider a computation complete when it halts, no further 
rule can be applied. Two ways of assigning a result to a computation were con- 
sidered: (a) by designating an internal membrane as the output membrane, (b) 
by reading the result outside the system. We deal here with the latter variant, 
where the output is obtained in the natural way: we arrange the symbols leaving 
the system, in the order they are expelled from the skin membrane; when several 
objects exit at the same time, any permutation of them is accepted. 

In this paper, we propose a new variant of P systems, P systems with membrane 
creation, in which some objects are productive and create membranes. This new 
variant of P systems is capable of solving the Hamiltonian Path Problem in linear 
time. We also show that P systems with membrane creation are computationally 
complete. 

2 Passive and Active P Systems 

P systems defined in H3 have the property that their membrane structure con- 
sists of m different membranes (w.r.t the labels of membranes) and during the 
computation this number may decrease (by dissolving membranes) but not in- 
crease. Similarly in one of the variants of P systems, i.e., P systems with active 
membranes defined in m and IS], the membrane structure consists of m dif- 
ferent membranes and during the computation this number may either decrease 
(by dissolving membranes) or increase (by dividing the existing membrane) but 
this number (w.r.t the labels of membranes) is always less than or equal to m. 
Such systems are called passive P systems. The degree of a passive P system is 
the number of initial membranes present in the system. 

Sometime it may happen that different type of membranes may be increased in 
a membrane system by creating new membranes whose label is different from 
the existing ones. Such systems are called active P systems. Since the different 
type of membranes may increase during the computation, we can not take initial 
membranes as the degree of the membrane system. So the degree of an active P 
system is defined as the ordered pair consisting of the number of initial mem- 
branes as the first component and the number of different type of membranes 
used in the whole computation as the second component. Here the second com- 
ponent is always greater than or equal to the first component. Since our variant, 
P systems with membrane creation belongs to the active P systems category, we 
follow latter notation for describing the degree of the membrane system. 
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3 Language Prerequisites 

A matrix grammar with appearance checking is a construct G = (iV, T, S', M, F)^ 
where N, T are disjoint alphabets, S G N, M is a finite set of sequences of the 
form {Ax — >• — >• a;„),n > 1, of context-free rules over N \JT (with 

Ai G N,Xi G {N U T)*, in all cases), and A is a set of occurrences of rules in 
M {N is the nonterminal alphabet, T is the terminal alphabet, S is the axiom, 
while the elements of M are called matrices.) 

For w,z G (iVUT)* we write w ^ z A there is a matrix {Ax — >• xi, • • • , A„ — >■ x„) 
in M and the strings Wi G {N \J T)* , 1 <i <n+l^ such that w = wx^z = ic„+i, 
and, for all 1 < j < n, either (l)wi = w[Aiw'l ,Wi+x = w[xiw” , for some w{,w” G 
{N U r)*, or {2)wi = Wi+x,Ai does not appear in Wi, and some rule Ai — >■ xi 
appears in F . (The rules of a matrix are applied in order, possibly skipping the 
rules in F if they cannot be applied, so we say that these rules are applied in 
the appearance checking mode.) 

The language generated by G is defined by L{G) = {w G T*\S =>* w}. The 
family of languages of this form is denoted by MATac- When F = (f> (hence we 
do not use the appearance checking feature) , the generated family is denoted by 
MAT. 

It is known that CFG MAT C MATac = RE, the inclusion being proper. 

A matrix grammar G = {N, T, S, M, F) is said to be in the binary normal form 
if TV = A^iUA^ 2 U{iS', f}, with these three sets mutually disjoint, and the matrices 
in M are in one of the following forms: 

1. {S ^ XA),with X G Nx,Ag N 2 ] 

2. {X ^Y,A^ x),with X,Y G Nx,Ag N 2 ,x G {N 2 LIT)*; 

3. {X ^Y,A^^),withX,Y gNx,AgN2-, 

4. {X ^ X, A ^ x), with X G Nx, A G N 2 , X G T*-, 

Moreover, there is only one matrix of type 1 and F consists exactly all rules 
A — f appearing in matrices of type 3; f is called a trap symbol, because once 
introduced, it is never removed. A matrix of type 4 is used only once, in the last 
step of the derivation. 

According to 0, for each matrix grammar there is an equivalent matrix gram- 
mar in the binary normal form. 

For an arbitrary matrix grammar G = {N,T, S, M, F), let us denote by ac(G), 
the cardinality of the set {A G N\A a G F}. From the construction in the 
proof of Lemma 1.3.7 in j0| one can see that if we start from a matrix grammar 
G and we get the grammar G' in the binary normal form, then ac(G') = ac(G). 
Improving the result from H2| ( six nonterminals, all of them used in the appear- 
ance checking mode, suffice in order to characterize RE with matrix grammars), 
in [Zj it was proved that four nonterminals are sufficient in order to characterize 
RE with matrix grammars and out of them only three are used in appearance 
checking rules. Of interest here is another result from [71; if the total number of 
nonterminals is not restricted, then each recursively enumerable language can 
be generated by a matrix grammar G such that ac(G) < 2. 

Consequently, to the properties of a grammar G in the binary normal form we 
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can add the fact that ac{G) < 2. This is called as strong binary normal form for 
matrix grammars. 

A multiset over an alphabet V = {oi, • • • , On} is a mapping ^ from V to N, the 
set of natural numbers, and it can be represented by any string w € V* such 
that ^v{w) — (/t(oi), • • • , /i(a„)), where is the Parikh mapping associated 
with V . Operations with multisets are defined in the natural manner. 

4 P Systems with Membrane Creation 

One of the primary goals of all living creatures is to survive. To do so, cells must 
be able to reproduce. Cells [Q can reproduce in two ways, mitosis and meiosis. 
Meiosis is a form of sexual reproduction and only occurs in gametes (reproduc- 
tive cells). Mitosis0 is the process in which an Eukaryotic cell divides into two 
equal, genetically identical parts. There are 5 stages in mitosis, those are: 1. 
Interphase, 2. Prophase, 3. Metaphase, 4. Anaphase, 5. Telophase. In Prophase, 
the nuclear membrane will dissolve and a new nuclear membrane will reform in 
Telophase. So in nature, it happens that membranes emerge in certain circum- 
stances just by the organization of certain chemical compounds M- Similarly, 
each membrane (in biological sense) is having its own properties and the mem- 
branes have recognition proteins 0 with which it can recognize other membranes. 
With the help of indexing the membranes and using the target iui we model the 
behavior of a membrane. We try to model the behavior of a membrane creation 
with our new variant, i.e., P-sy stems with membrane creation. Each membrane 
in the membrane system is having both productive and non-productive objects. 
A productive object is an object which can create a new membrane and trans- 
forms into other object. A non-productive object only transforms into another 
object without creating new membrane. 

Formally a P system with membrane creation of degree n > m > 1, is a 

construct 



n = , W(m_i), i?o, i?i, • • • , i?(„_i)). 



where: 

1. U is an alphabet; it consists of both productive and non-productive objects; 

2. T C U, is the output alphabet; 

3. C (IV ^ (j), is the set of catalysts; 

4. /r is a membrane structure consisting of m membranes, with the membranes 
and the regions labeled in a one-to-one manner with elements in a given set; 
here we always use the labels 0 (for the skin membrane), 1,- • • , (m — 1); 

5. Wi, 0 < i < {m — 1), are multisets of objects over V associated with the 
regions 0, 1,- • • , (m — 1) of /r; 

6. Ri, 0<i<(n— 1), are finite set of evolution rules over V. An evolution 
rule is of two types: 

(a) If a is a single non-productive object, then the evolution rule is in the 
form a —>■ r: or ca —>■ cv, where c G C, a G (U — C), v = v' or v = v'S or 
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V = v't, where v' is a multiset of objects over {{V — C) x {here, out}) U 
{{V — C) X {irij\l < j < (n— 1)}), and 6,t are special symbols not in V. 
(b) If o is a single productive object, then the evolution rule is in the form 
a — >■ [i v]i or ca — >■ c[i v]i, where c G C,a € (V — C), u is a multiset 
of objects over (V — C). The rule of the form a — v]i means that 
the object identified by a is transformed into the objects identified by v, 
surrounded by a new membrane having the label i. No rule of the form 
a — >■ [o v]o can appear in any set Ri. During a computation the number 
of membranes can increase or decrease. 

The membrane structure and the multisets in II constitute the initial configu- 
ration of the system. We can pass from a configuration to another one by using 
the evolution rules. This is done in parallel: all objects, from all membranes, 
which can be the subject of local evolution rules, should evolve simultaneously. 
A rule can be used only if there are objects which are free at the moment when 
we check its applicability. 

The application of a rule ca — >■ cu in a region containing a multiset w means 
to remove a copy of the object a in the presence of c (catalyst), providing that 
such copies exist, then follow the prescriptions given by v: If an object appears 
in V in the form (a, here), then it remains in the same region; if it appears in the 
form (a, out), then a copy of the object a will be introduced in the region of the 
membrane placed outside the region of the rule ca — >■ cv; if it appears in the form 
{a, ini), then a copy of a is introduced in the membrane with index i, if such a 
membrane exist inside the current membrane, otherwise the rule cannot be ap- 
plied. If the special symbol t appears, then the thickness of the membrane which 
delimits the region where we work is increased by 1 . Initially, all membranes have 
the thickness 1. If a rule in a membrane of thickness 1 introduces the symbol r, 
then the thickness of the membrane becomes 2. A membrane of thickness 2 does 
not become thicker by using further rules which introduce the symbol t, but no 
object can enter or exit it. If a rule which introduces the symbol 5 is used in 
a membrane (including the skin membrane) of thickness 1, then the membrane 
is dissolved; if the membrane had thickness 2, then it returns to thickness 1. 
Whenever the skin membrane is dissolved, the whole membrane system will be 
destroyed. If at the same step one uses rules which introduce both 5 and r in the 
same membrane, then the membrane does not change its thickness. No object 
can be communicated through a membrane of thickness two, hence rules which 
introduce commands out and inj requesting such communications, can not be 
used. However, the communication has priority over changing the thickness: if at 
the same step an object should be communicated and a rule introduces the action 
T, then the object is communicated and “after that” the membrane changes the 
thickness. When applying a rule a — >■ [iv]i in a region j, a copy of a is removed 
and a membrane with the label i is created, containing the multiset v, inside the 
region of membrane j. We can never create a skin membrane. Similarly, when 
applying a rule ca — >■ c[iv]i in a region j, a copy of a, in presence of c (catalyst), 
is removed and a membrane with the label i is created, containing the multiset 
V, inside the region of membrane j. 
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A sequence of transitions between configurations of a given P system II is called 
a computation with respect to II. A computation is successful if and only if it 
halts, that is, there is no rule applicable to the objects present in the last con- 
figuration. The result of a successful computation is I^riw), where w describes 
the multiset of objects from T which have left the skin membrane during the 
computation. The set of such vectors I't{w) is denoted by Ps(77) (from “Parikh 
set”) and we say that it is generated by 77. (Note that we take into account only 
the objects from T.) 

The family of all sets of vectors of natural numbers Ps(7T) generated by a P 
system with membrane creation of degree (m, n),n > m > 1, with catalysts and 
the actions of both S, t, using the target indications of the form here, out, irij, is 
denoted by NPMC(^rn,n) {Cat, tar, S, r); when one of the features a £ {Cat, 6, r} 
is not present, we replace it with na. When the number of membranes is not 
bounded, we replace the subscript {m,n) with (*,*). 



5 Solving the Hamiltonian Path Problem 
for Undirected Graphs 

Given an undirected graph G= ( U, E) with n{> 2) nodes, where U is the set of 
nodes and E, the set of edges. The Hamiltonian path problem is to determine 
whether or not there exist an Hamiltonian path in G, that is, to determine 
whether or not there exist a path that passes through all the nodes in U exactly 
once. The Hamiltonian Path Problem for undirected graphs is known to be NP- 
Complete. It is possible to solve this problem, in a time which is linear in the 
number of nodes of a graph, by using a P system with membrane creation. 

Theorem 1. The Hamiltonian Path Problem for undirected graphs can be sol- 
ved, in a time which is linear in the number of nodes of a graph, by using a P 
system with membrane creation. 

Proof. Let G = {U, E) he an undirected graph with n(> 2) nodes. Let U = 
{oi, 02 , • • • , a„}. We now construct a P system with membrane creation of degree 
(l,n-|-l), NPMC(^i^n+i){nCat,tar,5,T), as 

77 = (U T,C,pl, Wo, 7?o,7?i,---,7?„), 

where V = {ai, a{, fi, f', d^, d{, c{l 1 < i < n, 0 < j < n}U{F}U{U| 0 < z < 3nj; 
(here only Ui are productive objects) 

T={Yj; 

C = cb-, 
h = [o]o; 

Wo = [to,ai,a 2 , - ■ ■ ,a„}; 

The set Ro contains the following rules: 

1. U — >■ U+i, 0 < z < 3n — 1; 

2 . ton — >■ S; 

3. c7 — >■ A, 1 < j < rz — 1, Vz; 
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4. c” — >■ {Y,out),'iv, 

5. di y [ifi 5 5 ‘ * j ^jk ]i'> 1 ^ 

(di will create a new membrane with label i and transform into a multiset 
of objects f'iC^dj-^ • ■ ■ dj^, where ji, • • • ,jk are vertices adjacent to vertex i) 

The set Ri,l < i < n, contains the following rules: 

1. dj -)> [jfj,d'j,d'j ^ , • • • , a' Jj, 1 < j < n; 

(dj will create a new membrane with label j and transform into a multiset 
of objects f'jd'jd'j^ ■ ■ ■ a'^, where ji, ■ ■ ■ ,jk are vertices adjacent to vertex j) 

2 . d'^ — >■ di', 

3- f[ ifi, injJifi, iuj^) ■ ■ ■ ifi, iuj^y, 

(here ji, • • • ,jk are vertices adjacent to vertex i) 

4- fj ifj , irij, ){fj, )•••(/,-, inj ^ ) , Vj yf b 

(here ji, • • • ,jk are vertices adjacent to vertex i) 

5. o' — >■ dj] 

6. di c°; 

7- fi ^ Ar; 

8. — >■ , out) , k > 0; 

AT ( rr\ — {A"™, if the given graph has ’m’ Hamiltonian paths.} 
(i.n+i)( otherwise. 

The system works as follows: Initially the skin membrane is having the pro- 
ductive objects Oi, • • • , a„. In the skin membrane, each productive object di, by 
using the rule di — >■ , • • • , dj^.]i, will create a membrane with index i and 

transform into a multiset of objects /', c°, ajj , • • • , dj^. . In membrane i, a produc- 
tive object dj will create a membrane with index j and transform into a multiset 
of objects f'jd'jdy ■ ■ ■ d'j^. In membrane i, fj will be replaced with fj, d'j will be 
replaced with dj, and a' with dj. In the next step, dj will be replaced with c° 
and dj will create a membrane by using the rule dj — >■ [jfjd'jdy ■ ■ ■ djjj and the 
process defined above will be repeated. Whenever a new membrane is created, 
all fj’s and /(’s, from its parent membrane, will move into that membrane. The 
role of fj is to increase the thickness of the membrane j, so that no object will be 
sent out from that membrane. This will eliminate the effect of the path, which 
contains multiple copies of the same node, on the result. For every rewriting 
step, Cj will be rewritten as and sent out of the membrane i provided that 
the thickness of the membrane i is 1. In the skin membrane c" (indicates that 
there exist a tour of path length n) will be replaced with Y and sent out. We 
use a counter ti in the skin membrane, so that for every rewriting step it will 
be incremented and whenever the counter reaches t^n, indicates the end of the 
computation, it will dissolve the whole membrane system by dissolving the skin 
membrane. So, if there exist a tour of length n, by traversing all nodes in the 
graph, then we can get Y from the system. 

Time Complexity: 

— This algorithm takes 3n -I- 1 steps for generating the output. 
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Note:- HPP problem can also be solved using P systems with aetive membranes 
na in 3n + 1 steps but it requires the rules for division of membranes. □ 

Example 1. Checking whether or not there exist an Hamiltonian Path in a given 
undirected graph. 

Sol: Let us consider the graph G = { U, E), with U = {ai, 02 , 03 } as shown 
below. We now construct a P system with membrane creation of degree (1,4) as 




n = {V,T,C,n,wo,Ro:--- ,R3), 

where 

V = <i <3,0 < j < 3}U {ti|0 < i < 9} U {¥}; 

T={Y}; 

C = </>; 

M = [o]o; 

Wo = {to, oi, • • • , 03 }; 

and Ri,0 < i < 3, as shown in the algorithm. 

1. [otoOia2a3]o; 



2. [o^l[l/lCl02]l[2/2C20ia3]2[3/3C3a2]3]o; 



3. [oi24c^C^[i/([2/2d2a'iay2]l[2/2[l/('^ia2]l[3/34a'2]3]2[3/3[2/2C^2aia'3]2]3]o 



4 . [0^3(1 [2/l/2'^2ai03]2] lb [1/1/2^102] l[ 3 / 2 / 3 d 3 « 2 ] 3]2 [3 [2/2/3<^2ai03]2]3]o; 



5- [o^4[i[2/i/2C2[i/i'^i02]i[3/3'^3®2]3]2]i 

[2[l/(/2C?b/2'^2aia3]2]l[3/2/3C3b/2C^2a'ia3]2]3]2 

[3b/2/3C2[l/l'^l«2]l[3/3'^3®2]3]2]3]o; 

6. [o^5[lC2b[l/l/2/Wia2]l[3/l/2/3d3a2]3]2]l 

[2C1 03)1 [2/1/2/2^201 03)2] 1(3 b/ 2 / 3 / 2 ^ 2 aia 3 ] 2 ] 3]2 
[3C2 [2 [1/2/3/1^102] l[ 3 / 2 / 3 / 3 '^ 3 « 2 ] 3 ] 2 ] 3 ]o; 
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7. [0t6Ct4clci[i[2{lf2fic°i[2f2dWia3]2}l[3flf2f34[2f2dWia3h]3h]l 

[2[l{2/l/2C2[l/Wiayi[3/3^3®y3}2]l[3{2/3/2C2[l/l'^l02]l[3/3'^3®2]3}2]3]2 
[3[2[l/2/3/lC?[2/2C^2aia3]2]l{3/2/3C3 [2/2C?2aiOy 2}3]2]3]o; 



8- [o^7[l[2C3{lC3[2/2/l/2C^2aia3]2}l[3[2/l/2/3/2'^2aia3]2]3]2]l 

[2 [l {2C2 [l /l /2/W1 02] 1 [3/1 72/3^3^2] 3}2]l [s{2C2[i 73/2 /lC^ia2]l 

[3/3/2/3<i3a2]3}2]3]2 

[3[2Ci[l[2/2/3/l/2'^2aia3]2]l{3C3[2/2/3/2'^2aiO3]2}3]2]3]0; 



9. [0^8[lC3[2{lC?{2/l/2C2[l/^<a'2]i[3/3d'3a'2]3}2}l[3{2/l/3/2C2[l/(<a2 



2 JI 



[3/3C^3®y3}2]3]2]l 
[2 [l{2C2{l/2/lCi[2/2rf2«l«3]2}l [3/1/2/3C3 [2/2(^201 a3]2]3}2]l 

[3{2C2[l/3/2/lCi[2/2<^2aia3]2]l{3/2/3C3 [2/2l^2a'ia3]2}3}2]3]2 

[3cf[2[l{2/3/2C2[l/WW2]l[3/34a2]3}2]l{3C3{2/3/2C2[l/Wia'2] 

[3/3<^3a2]3}2}3]2]3]o; 



10 . [o^gcf C3 [1 [2 {1C3 {2C2 [1 /l /2/idl 02] 1 [3/1 72/3^3 02]3}2 }l [3(202 [1/1 /3/2/Wl02]l 
[37i737273'^302]3}2]3]2]i 

[2[l{2C2C3{lC3[2727l72'^2aia3]2}l[3[27l727372'^2aia3]2]3}2]l 
[3{2C2c[[[l[273727l72'^2aia3]2]l{3C3 [27273720^2aia3]2}3}2]3]2 
[3 [2 [1 {2C2 [1 73727 i 0^102] 1 [3737273 ^302] 3 (2] i{3C3{2C2[i73 72 7(0^102] 1 
[3737273'^302]3}2}3]2]3]o; 



11. y2. 

Y"^ indicates that there exist 2 Hamiltonian paths in the given graph. Here 
oi — 02 — 03 and 03 — 02 — oi are the two paths. 

Note Here we used different notation ({ }) for membranes of thickness 2. 



6 Computational Universality 

We now investigate the computational power of a P system with membrane 
creation. Here in theorem 2 we show that a P system with membrane creation 
of degree (1,4) with catalysts and with the actions of r and 6 generate PsRE. 

Theorem 2. PsRE = NPMC(i_ 4 '){Cat,tar,T, S) . 

Proof. We prove only the inclusion PsRE C N PMC(i^i){Cat,tar,T,5), the re- 
verse inclusion can be proved in a straight forward manner. Let us consider a 
matrix grammar with appearance checking, G = (N, T, S, M, F ) , in the strong 
binary normal form, that is with fV = fVi U N 2 U {S', f}, with rules of the four 
forms mentioned in Section 3, and with ac{G) < 2. Assume that we are in the 
worst case, with ac{G) = 2, and let and be the two symbols in N 2 for 
which we have rules B^^'> — >■ f in matrices of M. Let us assume that we have h 
matrices of the form m' : (A — >• — >• f), A, F G Ai,j G {1,2}, 1 < i < h, 

and k matrices of the form rui : (A — >• a, A — >• x), A G Ai, A G A 2 , a G Ai U{A}, 
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and X G {N 2 U T)* , 1 < i < k. Each matrix of the form {X X, A ^ x),X G 
Ni,A G N 2 ,x G T* , is replaced by {X ^ f, A ^ x), where / is a new symbol. 
We continue to label the obtained matrix in the same way as the original one. 
The matrices of the form {X — Y, — >■ f), X, F G Ni are labeled by to', with 
i G labj, for j G {1, 2}, such that lab\, lab 2 and lah^ = {1, 2, • • • , fc} are mutually 
disjoint sets. 

We construct a P system with membrane creation as 

n = (V, T, C, /i, Wo, Rq, i?i, i? 2 , R 3 ), 

with the following components : 

U = iVi U IV 2 U {X', X,\ X G Ni,l<i<k} 

A{X' \ X G Ni,l <i<h} 

U{A,\AGN2,l<i<k} 

C = {c}; 

M = [o]o; 

Wq = {c, X, yl}, for (S' — >■ Xyl) being the initial matrix of G; 
and the sets Rq, i?i, i ?2 and S 3 contains the following rules: 

— Rq : 

1 . X^i^EX'l; 

2. X^ for to' : (X ^ F, ^ f),j G {1,2}; 

3. cA — >■ c{Ai,ino); 

4. b(A g{1,2}-, 

5. a — >■ (a, out); 

6 . / — >■ A; 

7. Z-)-X; 

8 . A — y A, A G N 2 ; 

— Si : 

1. S ^ At; 

2. X' -G Xi; 

3. Xi — >■ Xi+i, 1 < * < (fc — 1); 

4. Ai — >■ Ai^i, 1 < i < (A: — 1); 

5. Xi ^ ZYS, for mi : {X ^ Y, A ^ x),Y G NiU {/}; 

6 . Ai — >■ ZxS, for mi : {X ^ Y, A ^ x), F G fVi U {/}; 

7. F ^ t; 

8 . Xk — >■ ti 

9. Ak — >■ ti 
10. t ti 

— Rj+ 1 , where j G (1, 2}, such that to' : (X — >• F, -j-j : 

1. X' ^ F; 

2 . Y^YS; 

3. t ti 
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The system works as follows: 

Initially the skin membrane contains the objects X, A and a catalyst c. Here X 
is a productive object and creates a membrane. In order to simulate a matrix 
of type 2 or 4, the object X will create a membrane with index 0. Whenever a 
membrane with index 0 appears, with the help of catalyst c, a non-productive 
object A G N 2 will move into the membrane 0 as Hi. At the same time, X' will 
transform into Xi and the thickness of the membrane 0 will be increased to 2 by 
using the rule if — >■ Ar (this is to prevent the duplicate copies of A to come in). 
Once we have Xi and Ai in membrane 0, the subscripts of X and A are increased 
step by step. At any time we can apply the rules Xi — >• ZYS, Y G NiU {/} and 
Ai -G Zx6. But if we apply one of these rules first, then a trap symbol will be 
introduced by the rule Z — >■ f. So, in order to get the correct simulation, we 
have to apply these two rules at the same step. In this way we can complete 
the simulation of a matrix nii of type 2 or 4. If the symbol A in membrane 0 
is not the one in the matrix nii : {X -G Y, A -G x), then we cannot apply the 
rule Ai -G ZxS and a trap symbol will be introduced by the rule Z — >■ f. After 
simulating a matrix rrii : (X -G f, A ^ x) oi type 4, if there is any nonterminal 
A G N 2 present in the skin membrane, then the computation never halts due 
to the aplication of the rule A — >■ A. The symbol / will be erased in the skin 
membrane. 

For simulating a matrix of type 3, X will create a membrane with index (j -|- 
l),j G {1,2}. If there is a symbol present in the skin membrane, then it 
will be replaced with f and enter into the membrane (j -I- 1) and so that the 
computation never halts. Otherwise, in membrane (j -I- 1), the symbol A' will 
be replaced with Y and in the next step, this Y will be sent out by using the 
rule Y — >■ In this way we can complete the simulation of a matrix m'i. Thus, 
the equality <Ft(T(G)) = APMG(i^)(Gat, tar, r, d) follows. □ 

7 Conclusion 

As a variant of P systems, we have introduced a new variant of P systems, P 
systems with membrane creation, in which some objects are productive and create 
membranes. This new variant of P systems is capable of solving the Hamiltonian 
Path Problem in linear time. We showed that P systems with membrane creation 
are computationally complete. 
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Abstract. We introduce a continuous-space model of computation. This 
original model is inspired by the theory of Fourier optics. We show a lower 
bound on the computational power of this model by Type-2 machine sim- 
ulation. The limit on computational power of our model is nontrivial. We 
define a problem solvable with our model that is not Type-2 computable. 
The theory of optics does not preclude a physical implementation of our 
model. 



1 Introduction 

In this paper we introduce to the theoretical computer science community an 
original continuous-space model of computation. The model was developed for 
the analysis of (analog) Fourier optical computing architectures and algorithms, 
specifically pattern recognition and matrix algebra processors Pj. The function- 
ality of the model is limited to operations routinely performed by optical sci- 
entists thus ensuring it is implementable within this physical theory [ID] . The 
model uses a finite number of two dimensional (2-D) images of finite size and 
infinite resolution for data storage. It can navigate, copy, and perform other op- 
tical operations on its images. A useful analogy would be to describe the model 
as a random access machine, without conditional branching and with registers 
that hold images. This model has previously mu been shown to be at least 
as computationally powerful as a universal Turing machine (TM). However, its 
exact computational power has not yet been characterised. To demonstrate a 
lower bound on computational power we simulate a Type-2 machine. The upper 
bound is not obvious; the model can decide at least one language that a Type-2 
machine can not. This combination of super-Turing power and possible imple- 
mentation strongly motivates investigation of the model. In Sect.|21 we introduce 
the optical model of computation. In Sect. 0 we outline some relevant points 
from Type-2 Theory of Effectivity, and present our working view of Type-2 ma- 
chines. In Sect.0 we present our simulation of a Type-2 machine. We finish with 
a discussion of its super- Turing power and a conclusion (Sects. 0 and 0. 
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(a) 



(b) 



Fig. 1. Schematics of (a) the grid memory structure of our model of computation, 
showing example locations for the ‘well-known’ addresses a, b and sta, and (6) loading 
(and automatically rescaling) a subset of the grid into grid element a. The program 
|ld| 2 I 3 I 1 I 3 I . . . \hU{ instructs the machine to load into default location a the portion 
of the grid addressed by columns 2 through 3 and rows 1 through 3. 



2 The Optical Computational Model 

Each instance of our machine consists of a memory containing a program (an 
ordered list of operations) and an input. The memory structure is in the form of 
a 2-D grid of rectangular elements, as shown in Fig.Q(o). The grid has finite size 
and a scheme to address each element uniquely. Each grid element holds a 2-D 
infinite resolution comp lex- valued image. Three of these images are addressed 
by the identifiers a, b, and sta (two global storage locations and a program start 
location, respectively). 

The two most basic operations available to the programmer, Id and st (both 
parameterised by two column addresses and two row addresses), copy rectan- 
gular m X n (m,n G N, m,n > 1) subsets of the grid into and out of image a, 
respectively. Upon such loading and storing the image information is rescaled to 
the full extent of the target location [as depicted in Fig. n[&)]. Two additional 
real- valued parameters ziower and Zuppen specifying lower and upper cut-off val- 
ues, filter the rectangle’s contents by amplitude before rescaling, 

{ Dower ■ ^ Dower 

■^upper : Re[/(i,j)]> ■^upper ■ 

■ otherwise 

For the purposes of this paper we do not require the use of amplitude filtering 
and use an all-pass filter represented by the rationale 0/1 and 1/1 (we make use 
of the symbol ‘/’ for this purpose). The complete set of atomic operations is 
given in Fig. 0 

Each instance of our machine is a quintuple M = (D, L, Z, /, P), in which 
• D = {x,y) , x,y gN : grid dimensions 

• L = (ox, fly, 6xj by, Sxj Sy) , c, 6, s S N : locations of a, b, and sta 
• Z = (zmin, 2max, c) , z G C, r G Q : global amplitude bounds and ampli- 
tude resolution of grid elements 
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Id|cl|c2|rl|r2pi|2:u : cl,c2,rl,r2 G N; zi,Zu € Q; copy into a the rectangle 
defined by the coordinates (cl, rl) and (c2, r2). (zi,Zu) is 
the amplitude filter [we use (0/0, 1/1) everywhere for the 
purposes of this paper). 

: cl, c2, rl, r2 G N; zi, Zu G Q; copy the image in a into the 
rectangle defined by the coordinates (cl, rl) and (c2, r2). 

: perform a horizontal 1-D Fourier transform on the 2-D 
image in a. Store result in a. 

pv~] : perform a vertical 1-D Fourier transform on the 2-D image 
in a. Store result in a. 

: multiply (point by point) the two images in a and b. Store 
result in a. 

pT] : perform a complex addition of a and b. Store result in a. 
pp : replace a with the complex conjugate of a. 
br|cl|rl] : cl,rl G N; unconditionally branch to the instruction at 
the image with coordinates (cl, rl). 

[hit : halt. 



st|cl|c2|rl|r2pi|2:u 

E 



Fig. 2. The set of atomic operations permitted in the model. 



• / = [(iix, *iy, ipi) ,■■■ , (inx, iny, V’n)] j * G N, ijj G Image : the n inputs and 
their locations, where Image is a complex surface bounded by 0 and 1 in 
both spatial directions and with values limited by Z 

• P = [(pix,Ply,7ri) , . . . ,{Pmx,Pmy,T^m)] , P G N, 7T G {Id, St, h, V, *, •, -|-, 
br, hit, /, N} C Image : the m programming symbols, for a given instance of 
the machine, and each of their locations. N G Image represents an arbitrary 
row or column address. 

As might be expected for an analog processor, its programming language 
does not support comparison of arbitrary image values. Fortunately, not having 
such a comparison operator will not impede us from simulating a branching 
instruction (see Sect. HI). In addition, address resolution is possible since (?) our 
set of possible image addresses is finite (each memory grid has a fixed size) , and 
(a) we anticipate no false positives (we will never seek an address not from this 
finite set). 

3 Type-2 Theory of Effectivity 

Standard computability theory |B| describes a set of functions that map from 
one countably infinite set of finite symbol sequences to another. In “Type-2 
Theory of Effectivity” (TTE) |0|, ‘ computation’ refers to processing over infinite 
sequences of symbols, that is, infinite input sequences are mapped to infinite 
output sequences. If we use two or more symbols the set of such sequences 
is uncountable; TTE describes computation over uncountable sets and their 
subsets. The following is a definition of a Type-2 machine as taken from 
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Fig. 3. Our working view of a Type-2 machine: ( T) a halting TM; (lo) the output tape; 
(Yi) the input tape; {Y 2 ) the nonvolatile ‘work tape’. Controls A, B, and C represent 
the functionality to read from Yi, write to Yq, and read/ write to Y 2 , respectively. 



Definition 1. A Type-2 machine M is a TM with k input tapes together 
with a type specification (Yi, . . . , Y^, Yg) with Yi € {S* ^ giving the 
type for each input tape and the output tape. 

In this definition, 27 is a finite alphabet of two or more symbols, 27* is the set of 
all finite length sequences over 27, 27*^ is the set of all infinite length sequences 
over 27. There are two possible input /output tape types, one holds sequences 
from 27* and the other from 27*^ . 

Input tapes are one-way read only and the output tape is one-way write only. 
If the output tape restriction was not in place any part of an infinite output would 
not be guaranteed to be fixed as it could possibly be overwritten during a future 
computation step. Hence, finite outputs from Type-2 computations are useful 
for approximation or in the simulation of possibly infinite processes. A Type- 
2 machine either finishes its computation in finite time with a finite number 
of symbols on its output tape, or computes forever writing an infinite sequence. 
Machines that compute forever while outputting only a finite number of symbols 
are undefined in Type-2 theory 0. 

3.1 A New View of Type-2 Computations 

We maintain that a Type-2 machine can be viewed as a repeatedly instantiated 
halting TM that has an additional (read-only) input tape Y\ and an additional 
(write-only) output tape Yg. (Without loss of generality, the finite number of 
input tapes from Def. Ecan be mapped to a single tape.) A nonvolatile ‘work 
tape’ Y 2 is used to store symbols between repeated instantiations (runs) of the 
halting TM. This is illustrated in Fig. 0 T is the halting TM. Control A repre- 
sents the functionality to read from Yi. Control B represents the functionality 
to write to Yg. Control C represents the functionality to write to and read from 
Y2. 
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A Type-2 computation will proceed as follows. The halting TM T is instan- 
tiated at its initial state with blank internal tape(s). It reads a symbol from Yi. 
Combining this with symbols (if any) left on Y 2 by the previous instantiations, 
it can, if required, write symbols on I 2 and Yq. T then halts, is instantiated once 
more with blank internal tape(s), and the iteration continues. The computation 
will either terminate with a finite sequence of symbols on the output tape or 
compute forever writing out an infinite sequence. In this light, an infinite Type- 
2 machine computation corresponds to an infinite sequence of instantiations of 
a single halting TM (plus extra computation steps for controls A, B, and C). 

4 Simulation 

We use simulation as a technique to measure computational power. If we can 
show that machine Mq can simulate every operation that machine Mi performs, 
we can say that Mg is at least as powerful as M\. Universality for our machine 
has already been proved 13 0 following Minsky’s arithmetisation of TMs 0 
(representing a TM in terms of quadruples of integers). Four images were used 
to represent Minsky’s four registers. In this paper our TM simulation is more 
efficient, reducing the number of required commands from 52 to 17. We refine 
this TM simulation into a Type-2 machine simulation. Although straightforward, 
these simulations are technically nontrivial. We were required to overcome the 
restriction of no conditional branching in our model and to define an appropriate 
view of Type-2 computations. 

In general, a TM could be simulated by a look-up table and the two stacks 
m and n, as shown in Fig. 0 A given TM [such as that in Fig. 0a)] is written 
in the imperative form illustrated in Fig. mb), where the simulation of state 
changes and TM tape head movements can be achieved with two stacks and two 
variables as shown in Fig. 0c). 

In order to simulate a stack we previously effected indirect addressing with a 
combination of program self-modification and direct addressing. We also simu- 
lated conditional branching by combining indirect addressing and unconditional 
branching 00. This was based on a technique by Rojas 0 that relied on the 
fact that our set of symbols is finite. Without loss of generality, in our simulation 
we will restrict ourselves to three possible symbols, ‘O’, ‘1’ and a blank symbol 
‘b’. Then, the conditional branching instruction “if (a=‘l’) then jump to address 
X, else jump to U” is written as the unconditional branching instruction “jump 
to address a”. We are required only to ensure that the code corresponding to 
addresses X and Y is always at addresses ‘1’ and ‘O’, respectively (and that 
we take into account the case where a =‘b’). In a 2-D memory (with an extra 
addressing coordinate) many such branching instructions are possible. 

4.1 Shorthand Conventions 

To facilitate persons reading and writing programs, a shorthand notation is used 
(see Fig.0. In this shorthand, instead of having to specify exact addresses, we 
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g 


s 


s 


d 


g 


0 




T 


R 


1 


0 


1 


0 


R 


1 


0 


b 


b 


R 


2 


1 


b 


b 


L 


3 



0: initial state 
1: moving left 
2: rejecting halt 
3: accepting halt 



q := initial_state ; 
halt := false; 
while (halt = false) 


{ 


select 


case (q, s) 


{ 


(0,0) 


: fn(l,R,l) 




(0,1) 


: fn(0,R,l) 




(0,b) 


: fn(b,R,2) 




(l,b) 


: fn(b,L,3) 




else : 


halt : = true ; 


} 






} 







void fn(s’,d,q’) { 
if(d = R) { 
push(m, s ’ ) ; 
s := pop(n) ; 

} else { 

push(n, s ’ ) ; 
s := pop(m) ; 

} 

q := q>; 

} 



(a) 



(&) 



(c) 



Fig. 4. Figure showing (a) an example TM table of behaviour. This machine flips the 
binary value at its tape head and halts in an accepting state. If there is a blank at 
its tape head it halts in a rejecting state; (&) an illustration of how an arbitrary TM 
table of behaviour might be simulated with pseudocode; (c) how one might effect a 
TM computation step with stacks m and n. 



give images a temporary name (such as ‘xl’) and refer to the address of that 

image with the character. So, when the programmer writes 

(s)he would intend 5 hMi3 | i3 | ^ I ^ I g I / I M M / ' ' J "^^ere the blank 

or undefined image at coordinates (13,5) will be overwritten with the contents 
of a before the statement to which it belongs is executed. Expansion from this 
shorthand to the long-form programming language is a mechanical procedure 
that could be performed as a ‘tidying-up’ phase by the programmer or by a pre- 
processor. Unless otherwise stated, we assume that the global bounds on image 
amplitude values are ^min = 0 and 2m ax = 1 - The load and store operations 
contain 0/1 (= 0) and 1/1 (= 1) for their 2iower and Zupper parameters, respec- 
tively, indicating that the complete image is to be accessed. As a convention we 
use underlining in program grid elements whose images can be modified by the 
machine and italics to highlight points of TM termination within the grid. 

4.2 Push and Pop Routines 

Images can be stacked using a stepwise rescaling technique. Take an empty 
image, representing an empty stack, and an image i to be pushed onto the 
stack. Place both side-by-side with i to the left and rescale both into a single 
grid element. This could be regarded as an image stack with one element. Take 
another image, j, place it to the left of the stack and rescale both into a single 
grid element once again. This single image is a stack with two elements. A pop 
operation would involve stretching this stack over two grid elements. The left- 
hand image will then contain a single image (j in this case) and the right-hand 
image will contain the remainder of the stack. The stack can be repeatedly 
rescaled over two images popping a single image each time. 
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An implementation of such a stack system in our model requires indirectly 
addressing the stack and employing a third image (in addition to a and b) named 
c. The low-level details are as follows. In advance of the push operation we ensure 
that the column number of the stack is stored in a and the element to be pushed 
is stored in c. (The column number is sufficient as all stacks will be located on 
row 99 in Fig. 0) The push routine begins by copying the contents of the stack 
into b and moving c into a. Then both a and b are rescaled into a and stored 
back in the stack’s original location. In advance of a pop operation we ensure 
that the column number of the stack in question will be stored in a. The pop 
routine begins by loading the stack into a and rescaling it over both a and b. 
Image a now contains the top image and b contains the remainder of the stack. 
The top element is temporarily stored in c, the contents of the stack stored back 
in its original location, and the popped element returned to a before the routine 
ends. 

We use these routines to simulate the movement of the TM tape head as 
illustrated in Fig. Efc). The operational semantics of push and pop (including 
the use of self-modification to load the contents of a stack into a) can be found 
in rows 7 and 8 of Fig. 0 

4.3 Type-2 Machine Simulation 

An arbitrary Type-2 machine is incorporated into our simulation as follows. 
Firstly, transform the Type-2 machine into a Type-2 machine that operates over 
our alphabet. Then rewrite the machine to conform to the form shown in Fig. El 
For the purposes of this simulation we represent Y 2 with TM T’s internal tape 
(essentially using the semi-infinite tape to the left of the tape head). When T 
halts it will either be in an accepting or rejecting state. T’s accepting state 
is equivalent to the simulator’s initial state (i.e. T passes control back to the 
simulator when it halts). At the simulator’s initial state it checks if T’s tape 
head was at a non-blank symbol when T halted. If so, it writes that symbol to 
lo- All symbols to the left of the tape head (essentially the contents of I 2 ) will 
be retained for the next instantiation of T. Next, the simulator reads a symbol 
from Yi and writes it on T’s tape in the cell being scanned by T’s tape head. 
It then passes control to T, by going into T’s initial state. If at any time T 
halts in a rejecting state we branch to the simulator’s halt state. In Fig. El we 
simulate a specific example of a Type-2 machine that flips the bits of its binary 
input. If the input is an infinite sequence it computes forever, writing out an 
infinite sequence of flipped bits. If the input is finite it outputs a finite sequence 
of flipped bits. 

4.4 Explanation of Figs. O and ini 

The Type-2 simulation by our model is shown in Fig. 0 It consists of two parts 
(separated in the diagram for clarity). The larger is the simulator (consisting of 
a universal TM, functionality A, B, and C from Fig.Q and stacks Y\ and Fq)- 
A TM table of behaviour must be inserted into this simulator [the example TM 
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is a universal Type-2 machine simulator and the smaller is its halting TM table of behaviour. [The example TM we use 
here is that in Fig. 0[a).j The simulator is written in a compact shorthand notation. The expansions into sequences of 
atomic operations are shown in Fig. Eland the simulation is explained in Sect. in 
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Fig. 6. Time-saving shorthand conventions used in the simulator in Fig. El These are 
explained in Sect. 
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is from Fig. Ua)]. It has a straightforward encoding. Notice how for each row of 
the table of behaviour {q, s, s' , d, q') an ordered triple {s',d,q') is placed at the 
location addressed by coordinates (q,s). 

Row 99 of Fig.|3is the ‘well-known’ row containing stacks, constants and the 
locations a, b, c, and sta. To permit indirect addressing, locations m, n, Yi, 
and Yq store the column numbers of their respective stacks. Yi and Yq represent 
the one-way tapes from Fig. 0 (we pop from Yi and push to Iq)- The stack 
m encodes all symbols on T’s tape to the left of tape head, and the stack n 
encodes all symbols on T’s tape to the right of the tape head. Image s encodes 
the symbol currently being scanned by T’s tape head. The blank constant ‘b’ is 
represented by ‘2’. Before execution begins, an input will have been encoded in 
stack Yi, grid element (6,99), in place of symbol ‘?’. Control flow begins at the 
program start location sta, grid element (17, 99), and proceeds rightwards until 
one of br or hit is encountered. 

In rows 7 and 8 we have the push and pop routines; these are explained in 
Sect. 14.21 Rows 5 and 6, named mvl and mvr respectively, contain commands to 
simulate moving left and right on T’s tape. This is effected by a combination of 
pushes and pops to the stacks m and n and image s that encode T’s tape sym- 
bols. The ‘ret’ command returns execution to the point from which a subroutine 
mvl, mvr, psh, or pop was called. Rows 3 and 4 describe what is to happen 
when the inserted TM halts in a rejecting or accepting state, respectively. In the 
former case the Type-2 machine simulation halts. In the latter case, the TM is 
reinstantiated in accordance with the description in Sect. ft. II If the TM’s tape 
head had written ‘0’ or ‘1’ as it halted, control flow then moves to row 0 or 1, 
respectively, and the appropriate symbol is written to Yq before instantiation. 
If the tape head writes a ‘2’ (a blank) as it halted, control flow moves to row 2 
and the TM is directly reinstantiated. 

For our particular example inserted TM [from Fig. 0(a)] if it reads a ‘0’ or 
‘1’ it halts in an accepting state and control moves to the beginning of row 4. If 
the input to our simulation is finite, an instantiation will eventually read a blank 
symbol, will enter state ‘q2’, and the Type-2 simulation will halt. Otherwise, it 
will be repeatedly instantiated, each time flipping one bit. 

The simulation is written in a shorthand notation (including shorthand ver- 
sions of Id, st, and br from Fig. E| which is expanded using Fig. El Figure El[ a) 
shows the expansion of shorthand notation used in setting up calls to the psh 
and pop routines (loading the appropriate stack address into a in advance of a 
pop, and storing into c the symbol to be pushed and loading the stack address 
into a in advance of a push). In advance of a psh the element to be pushed 
will be in a. After a pop, the popped element will be in a. Figure EI[&) shows 
commands for branching to an address where the row is specified by the symbol 
currently scanned by T’s tape head. Figure EJc) shows routines for simulating 
the execution of a row of T’s table of behaviour. Commands for loading from 
and storing to locations specified at runtime, and to/from the ‘well-known’ loca- 
tions on row 99 are in Fig. Eld). Figure El e) shows the commands for branching 
to subroutines. Finally, Fig. El/) illustrates how all labels are eventually given 
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absolute addresses by a preprocessor. After a first pass of the preprocessor (ex- 
panding the shorthand) the modifiable (underlined) references are updated with 
hardcoded addresses. 

5 Super- Turing Capabilities of Our Model 

Type-2 machines do not describe all of the computational capabilities of our 
model. The model’s atomic operations operate on a continuum of values in con- 
stant time (independent of input size) and would not have obvious TM or Type-2 
machine implementations. 

Consider the language L defined by its characteristic function / : — >■ {0, 1}, 

where 



f{p) ■■= 




otherwise 



and where p is an infinite sequence over alphabet {0,1}. This language is accept- 
able but not decidable by a Type-2 machine 0 (Ex. 2. 1.4. 6). In our model, we 
encode a boolean value in an image by letting a (^-function at its origin denote 
a ‘1’ and an empty image (or an image with low background noise) denote a 
‘O’. An infinite sequence of boolean-valued images could be presented as input 
concatenated together in one image without loss of information (by definition, 
images in our machine have infinite spatial resolution) . An off-centre peak can be 
centred for easy detection through Fourier transformation (using the shorthand 
program |id|Yi| h | v |st| b | * |~| ) . This uses the property that the term at the origin 
of a Fourier transform of an image, the dc term, has a value proportional to the 
energy over the entire image. Our model could therefore Fourier transform the 
continuous input image and then measure the value of the dc term in unit time. 
A peak would indicate that there is some energy (and therefore at least one ‘1’) 
somewhere in the image; the corresponding word is in L. An absence of a peak 
at the origin indicates that there is not a ‘1’ in the image; the corresponding 
word is not in L. 



6 Conclusion 

We introduce an original continuous-space model of computation to the com- 
puter science community. We show its relationship to existing theory by proving 
that it is at least as powerful as the Type-2 machine model. We are currently in- 
vestigating the relationship between our model and piecewise affine maps md. 
As the model remains faithful to the theory of physical optics we propose that it 
could serve as an implementation of Type-2 machines. Furthermore, the upper 
bound on the computational power of this model is nontrivial and deserves to 
be analysed. We present at least one problem, decidable by our model, that is 
undecidable by a Type-2 machine. The model’s computational power, combined 
with implementation possibilities, strongly motivates continued investigation. 
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Abstract. In this paper we show that the problem of the existence of a 
p-optimal proof system for SAT can be characterized in a similar manner 
as J. Hartmanis and L. Hemachandra characterized the problem of the 
existence of complete languages for UP. Namely, there exists a p-optimal 
proof system for SAT if and only if there is a suitable recursive presen- 
tation of the class of all easy (polynomial time recognizable) subsets of 
SAT. Using this characterization we prove that if there does not exist a 
p-optimal proof system for SAT, then for every theory T there exists an 
easy subset of SAT which is not T-provably easy. 



1 Introduction 

The question of the existence of a p-optimal proof system for SAT (the set of 
all satisfiable boolean formulas) was posed by J. Kobler and J. Messner in p|. 
This question, like the similar and older problem of the existence of a p-optimal 
proof system for TAUT (the set of all tautologies in Propositional Logic), is 
still open. The related problem of whether the natural proof system for SAT (a 
satisfying truth assignment is a proof of formula a) is p-optimal, was considered 
in different nomenclature by S. Fenner, L. Fortnow, A. Naik and J. Rogers in 0 
(see also 0)- 

A proof system for a language L is a polynomial time computable function 
whose range is L. This notion was defined by S. Cook and R. Reckhow in |21- To 
compare the efficiency of different proof systems, they considered the notion of 
p-simulation. Intuitively a proof system h p-simulates a second one g if there is 
a polynomial time computable function t translating proofs in g into proofs in h. 
A proof system is called p-optimal for L when it p-simulates any proof system 
for L. 

The existence of a p-optimal proof system for TAUT, historically the first 
question about optimal proof systems, was posed by J. Krajicek and P. Pudlak 
0 in 1989. Since then, research concerning optimal proof systems has acquired 
importance in Proof Complexity Theory and Structural Complexity Theory. 
J. Kobler and J. Messner p] proved that the existence of both p-optimal proof 
systems for SAT and for TAUT suffices to obtain a complete language for 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 300-|^^] 2001. 
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NP n co-NP. This result generated interest in the problem of the existence 
of a p-optimal proof system for SAT . 

It is not currently known whether UP and other promise classes have com- 
plete languages. J. Hartmanis, and L. Hemachandra pointed out in ^ that UP 
possesses complete languages if and only if there is a recursive enumeration of 
polynomial time clocked Turing machines covering all languages from this class. 
Earlier, the question of whether NP fl co-NP possesses complete languages was 
related to an analogous statement (see |^). 

In this paper we show that the question of the existence of a p-optimal proof 
system for SAT can be characterized in a similar manner. We prove that a 
p-optimal proof system for SAT exists if and only if there is a recursive enumer- 
ation of polynomial time clocked Turing machines covering all easy (polynomial 
time recognizable) subsets of SAT. This means that the problems of the existence 
of complete languages for promise classes and the problem of the existence of a 
p-optimal proof system for SAT, though distant at first sight, are structurally 
similar. Since complete languages for promise classes have been unsuccesfully 
searched for in the past, our equivalence gives some evidence of the fact that 
p-optimal proof systems for SAT might not exist. 

2 Preliminaries 

We assume some familiarity with basic complexity theory, see The symbol 
E denotes, throughout the paper, a certain fixed finite alphabet. The set of all 
strings over S is denoted by S* . For a string x, |a;| denotes the length of x. The 
symbol FP denotes the class of functions that can be computed in polynomial 
time. 

We use Turing machines (acceptors and transducers) as our basic compu- 
tational model. We will not distinguish between a machine and its code. For 
a deterministic Turing machine M and an input w, TIME(M, w) denotes the 
computing time of M on w. 

We consider polynomial time clocked Turing machines with uniformly at- 
tached standard -\-k clocks which stop their computations in polynomial time 
(see m). We impose some restrictions on our encoding of these machines. From 
the code of any polynomial time clocked Turing machine, we can detect easily 
(in polynomial time) the natural k such that n^-|-fc is its polynomial time bound. 

The symbol TAUT denotes the set (of all encodings) of propositional tau- 
tologies; SAT denotes the set of all satisfiable boolean formulas. 

Finally, < > denotes some standard polynomial time computable 

tupling function. 

3 P-optimal Proof Systems 

and Almost Optimal Algorithms for SAT 

The systematic study of the efficiency of propositional proof systems (proof sys- 
tems for TAUT) was started by S. Cook and R. Reckhow in |2j. They introduced 
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the abstract notion of a proof system for TAUT and proved that a polynomially 
bounded proof system for TAUT exists if and only if NP=co-NP (see Enni)- 

The notion of a proof system can be considered not only for TAUT but also 
for any language L. 

Definition 1. (see m) A proof system for L is a polynomial time computable 
function h : E* L. 

If h{w) = X we may say that w is a proof of x in h. It follows from the above 
definition that for any proof system h there exists a polynomial time clocked 
transducer which computes h. 

To classify propositional proof systems by their relative efficiency S. Cook 
and R. Reckhow introduced in |5| the notion of p-simulation. For proof systems 
for SAT this notion can be defined as follows: 

Definition 2. (see m) Let h, h' be two proof systems for SAT. We say that h p- 
simulates h' if there exists a polynomial time computable function 7 : E* — > E* 
such that for every a € SAT and every w € E* , if w is a proof of a in h' , then 
j(w) is a proof of a in h. 



Definition 3. (see m) A proof system for SAT is p-optimal if it p-simulates 
any proof system for SAT. 

The notion of an almost optimal deterministic algorithm for TAUT was 
introduced by J. Krajicek and P. Pudlak [S|. We will consider an analogous 
notion of an almost optimal deterministic algorithm for SAT with Krajfcek- 
Pudlak’s optimality property. 

Definition 4. (cf. f / An almost optimal deterministic algorithm for SAT is 
a deterministic Turing machine M which recognizes SAT and such that for any 
deterministic Turing machine M' which recognizes SAT there exists a polynomial 
p such that for every satisfiable boolean formula a 

TIME(M; a) < p(lal,TIME(M'; a)) 

We name this algorithm as an almost optimal because the optimality condi- 
tion is stated only for any input string x which belongs to SAT and nothing is 
claimed for other x's (compare the definition of an optimal acceptor for SAT in 

0 )- 

The following result was proved in (see also 0). 

Theorem 1. Statements (i) - (ii) are equivalent. 

(i) There exists an almost optimal deterministic algorithm for SAT. 

(ii) There exists a p-optimal proof system for SAT. 
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4 P-optimal Proof System for SAT 

and the Structure of Easy Subsets of SAT 

The problems of the existence of complete languages for NP fl co-NP and UP 
are oracle dependent (see H2! and 0), and therefore they seem to be very diffi- 
cult. It was shown in jSj and 0 that these problems can be related to statements 
about the existence of recursive enumerations of polynomial time clocked Turing 
machines covering all languages from these classes. We will show in this section 
that the problem of the existence of a p-optimal proof system for SAT can also 
be related to an analogous statement. 

Definition 5. We say that a polynomial time clocked transducer M behaves well 
on input w if M on w outputs a satisfiable boolean formula. 

To any polynomial time clocked transducer M and any w G S* we shall 
assign the boolean formula TESTm,w such that: 

The formula TESTm.w is satisfiable if and only if M behaves well on input w. 

Our construction of the formula TESTm.w is adapted from Cook’s proof 
that SAT is NP-complete. Let S' be a fixed nondeterministic Turing machine 
working in polynomial time and accepting SAT . Let M' be the Turing machine 
which on any input x runs M and then runs S on the output produced by M . 
The formula TESTm.w is just Cook’s formula for the pair < M',w >. 

Test formulas possess the following properties: 

(1) Global uniformity property 

There exists a function / G FP such that for any polynomial time clocked 
transducer N with time bound + k and for any w G S* 

f{{N,w,Q\'^\"+^))=TESTN,n, 

(2) Local uniformity property 

Let M be any fixed polynomial time clocked transducer. There exists a 
function /m G FP such that for any w G E* 

fM{w) = TESTM,n, 



Definition 6. By an easy subset of SAT we mean a set B such that B C SAT 
and B G P. 

Let Di, £> 2 , 1?3, ... denote the standard enumeration of all polynomial time 
clocked deterministic Turing machines. 

Theorem 2. Statements (i) - (ii) are equivalent. 

(i) There exists a p-optimal proof system for SAT. 

(ii) The class of all easy subsets of SAT possesses a recursive P -presentation. 



304 



Zenon Sadowski 



By the statement (ii) we mean: there exists a recursively enumerable list 
of deterministic polynomial time clocked Turing machines Di^, Di^, ... such 
that 

(1) L{Di.) C SAT for every j 

(2) For every A C SAT such that AgP there exists j such that A = L{Di.) 

Proof, (i) — >■ (ii) The existence of a p-optimal proof system for SAT is equivalent 
to the existence of an almost optimal deterministic algorithm for SAT. Let M 
be such an algorithm. A recursive P-presentation of all easy subsets of SAT 
we will define in two steps. In the first step we define a recursively enumerable 
list of deterministic Turing machines F\, F2, F3,... The machine F^ is obtained 
by attaching the shut-off clock -I- A: to the machine M. On any input w, the 
machine Fk accepts w if and only if M accepts w in no more than + k steps, 
where n = |w|. The sequence F\, F2, F3, F4, ... of deterministic Turing machines 
possesses the properties (1) and (2): 

(1) For every i it holds L{Fi) C SAT 

(2) For every A which is an easy subset of SAT there exists j such that 
A c L{F,) 

To prove (2) let us consider A, an easy subset of SAT recognized by a Turing 
machine M" working in polynomial time. Combining this machine with the 
“brute force” algorithm for SAT we obtain the deterministic Turing machine 
M' recognizing SAT. From the definition of M' it follows that there exists a 
polynomial p such that for any a G A, TIME{M'-, a) < _p(|o;|). Since M is an 
almost optimal deterministic algorithm for SAT there exists a polynomial q such 
that for any a G A, TIME{M;a) < <z(|q^I)- From this we conclude that for a 
sufficiently large j, TIME{M] a) < |ap -I- j for any a G A, hence A C L{Ej). 

In the second step we define the new recursively enumerable list of deter- 
ministic polynomial time clocked Turing machines ATi, K2, K3,... We define AT„, 
n = {i, j), as the machine which simulates + i steps of Fi and + j steps of 
Dj and accepts w if and only if both Fi and Dj accept w and rejects w in the 
opposite case. 

Let A be any fixed easy subset of SAT. There exist k and m such that 
A = L{Dk) and A C L{Fm), hence A is recognized by the machine From 

this we conclude that iLi, K2, ATs, ... provides a recursive P-presentation of all 
easy subsets of SAT. 

{ii) -G (i) 

Let G be the machine generating the codes of the machines from the sequence 
Di^, Di^, Di^,... forming a recursive P-presentation of all easy subsets of SAT. 
We say that a string v G E* is in good form if 

V = {M, w, Comp - G, Comp - TESTm.w, oI“I'+'=) 



where: 

M is a polynomial time clocked Turing transducer with + k time bound, 
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w € E* , 

Comp — G is a computation of the machine G. This computation produces 

a code of a certain machine Di- , 

Comp—TESTM,w is a computation of the machine Di^ accepting the formula 
TESTm,w, 

0l™l is the sequence of zeros (padding). 

Let us note, that if v is in good form then TESTm,w is satisfiable as a 
formula accepted by a certain machine from P-presentation. This clearly forces 
M to behave well on input w, so M on input w produces a satisfiable boolean 
formula. 

Let «o be a certain fixed satisfiable boolean formula. We define 
Opt : S* — > E* in the following way: Opt{v) = a if w is in good form 

{v = {M, w, Comp - G, Comp - TESTm,w, oI“I"+'=)) 

and a is a satisfiable boolean formula produced by M on input w, otherwise 
Opt{v) = oo- 

Clearly, Opt : ^ SAT. 

In order to prove that Opt is polynomial time computable it is sufficient to 
notice that using global uniformity property we can check in polynomial time 
whether v is in good form. Hence Opt is a proof system for SAT . 

It remains to be proved that Opt p-simulates any proof system for SAT. Let 
/i be a proof system for SAT computed by a polynomial time clocked transducer 
K with time bound n} + 1. From the structure of Cook’s reduction (as TESTk.w 
clearly displays K and w) it follows that the set Ak= {TESTk.w- w G T'*} 
is an easy subset of SAT. Therefore there exists the machine Di^ from the P- 
presentation such that Ak = L{Di.). The function t : E* — E* translating 
proofs in h into proofs in Opt can be defined in the following way: 

t{x) = {K, X, Comp — G, Comp — TESTk,x, 

The word Comp — G in the definition of t is the computation of G producing 
the code of Dj,., Comp — TEST^^x is a computation of Di^ accepting TEST^.x- 

In order to prove that t is polynomial time computable let us notice that 
Comp — G is fixed and connected with K, so the only task is to prove that 
Comp — TESTk,x can be constructed in polynomial time. This follows from the 
local uniformity property of TESTk.x formulas and from the fact that Di^ is 
deterministic and works in polynomial time. 



5 Independence Resnlt 

Let T be any formal theory whose language contains the language of arithmetic, 
i. e. the language {0, 1, <, =, +, •}• We will not specify T in detail but only 
assume that T is sound (that is, in T we can prove only true theorems) and the 
set of all theorems of T is recursively enumerable. 
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The notation T \- (3 means that a first order formula [3 is provable in T. 

Let M be a deterministic Turing machine. 

Definition 7. We say that M is consistent if M accepts only satisfiable boolean 
formulas (if M accepts w, then w G SAT). 

By “L{M) C SAT^’ we denote the first order formula which expresses the 
consistency of M , i.e. yweL{M)[w is a satisfiable boolean formula]. 

Definition 8. A deterministic Turing machine M is T — consistent if and only 
ifTh “L{M) C SAT’. 



Definition 9. A set A C SAT is T-provably easy if there exists deterministic 
polynomial time clocked Turing machine M fulfilling (1) - (2) 

(1) M is T — consistent; 

(2) L(M)=A. 

Theorem 2 can be exploited to obtain the following independence result. 

Theorem 3. If there does not exist a p-optimal proof system for SAT, then for 
every theory T there exists an easy subset of SAT which is not T-provably easy. 

Proof. Suppose, on the contrary, that there exists a theory T such that all easy 
subsets of SAT are T-provably easy. Then the following recursively enumerable 
set of machines f2j'={M\ M is a deterministic polynomial time clocked Turing 
machine which is T — consistent} creates a recursive P-presentation of the class 
of all easy subsets of SAT. By Theorem 2, this implies that there exists a p- 
optimal proof system for SAT, giving a contradiction. 
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Abstract. DOL systems consist of iterated letter-to-string morphism 
over a finite alphabet. Their descriptional power is rather limited and 
their length sequences can be expressed as sum of products of polynomial 
and exponential functions. There were several attempts to enrich their 
structure by various types of regulation, see e.g. J], leading to more 
powerful mechanisms. 

Due to increasing interest in biocomputing models, V. Mihalache and 
A. Salomaa suggested in 1997 Watson-Crick DOL systems with so called 
Watson-Crick morphism. This letter-to-letter morphism maps a letter to 
the complementary letter, similarly as a nucleotide is joined with the 
complementary one during transfer of genetic information. Moreover, 
this new morphism is triggered by a simple condition (called trigger), 
e.g. with majority of pyrimidines over purines in a string. 

This paper deals with the expressive power of standard Watson-Crick 
DOL systems. A rather unexpected result is obtained: any Turing com- 
putable function can be computed by a Watson-Crick DOL system. 



1 Watson-Crick DOL Schemes 

For elements of formal language theory we refer to jTITT!) . Here we only briefly 
fix some notation. For a finite alphabet S, denote (if, •) a free monoid with the 
catenation operation and the empty word A. For a € S, w G S*, |zc|a is the 
number of occurrences of a in w. For F C E, jrch = Saer I^U- For w G E* we 
denote tc" the catenation of n copies of tc for n > 1. 

A concept of Watson-Crick alphabet is a straightforward generalization of 
notions of “natural” DNA alphabet consisting of symbols {A,C,G,T}. 

A DNA-like alphabet E is an alphabet with an even cardinality 2n, n > 1, 
where the letters are enumerated as follows: 

E — , . . . , Qn , Ui , . . . , On} ■ 

We say that and of are complementary letters. The letter to letter endomor- 
phism hw of E* mapping each letter to the complementary letter is called the 
Watson-Crick morphism. Hence 

hw{ai) = of, hu,(of) = Oi, I < i < n. 

M. Margenstern and Y. Rogozhin (Eds.): MCU 2001, LNCS 2055, pp. 308-|^l^ 2001. 
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In analogy with the DNA alphabet we call the non-barred letters purines and 
the barred letters pyrimidines. The subset of S* consisting of all words, where 
the number of occurrences of pyrimidines is strictly greater than that of purines 
is denoted by PYR. The complement of PYR is denoted by PVR. We further 
denote the sets ^ ^ a set Z\ C S, 

we denote hw{^) = {hw{a) \ a S A}. 

Definition 1. A standard Watson-Crick DOL scheme (or shortly scheme^ is a 
eonstruet G = (T',p), where S = {ai, . . . , a„, ol, • ■ ■ , On}, P ■ — > AJ* is a 

morphism. Given a word wq € S* , the derivation sequence S{G,wo) defined by 
G from wq consists of the words Wq, Wi, W 2 , • ■ • , where for i > 0, 

^ J p{wi), ifp{wi) G PUR 
\hwip{wi)), ifp{wi)GPYR. 



The transition Wi =>g R’i-i-i is also called the derivation step of G. If Wi+i = 
hy/{p{wi)), then we speak about complementation derivation step. We denote 
the transitive and reflexive closure of =^g as usual. 

For more details and motivation underlying the concept of Watson-Crick DOL 
systems see |4Iil8l9j (where a Watson-Crick DOL system differs from scheme only 
by adding an axiom wq G PUR). In 0 and others, a Watson-Crick DOL system 
is viewed also as a DTOL system with two morphisms p and together with a 
regulation mechanism guiding the selection of the morphism. Contrary to DTOL 
system, the regulation mechanism gives rise to determinism and the system 
generates a unique sequence of words. Another important difference is in fact 
that the length of the derivation can be determined by the triggering mechanism, 
as the further example shows. This is the main principle of the below proofs 
showing universality of Watson-Crick DOL schemes. 



2 Partial Recursive Functions 

In this section we briefly resume a widely-used characterization of partial re- 
cursive functions in a machine-independent way. For more details and also for 
famous Kleene’s theorem, claiming the existence of an universal function, we 
refer to nm. 

The symbol N is used for the set of all nonnegative integers. We denote 
(partial) recursive functions the class of functions / : N* — >■ N for some t > 
0, computable by Turing machines. The word “partial” can be omitted if the 
domain of / is N*. The below notation can be easily extended to functions 
N‘ — )■ N®, s > 0. 

Definition 2. The family of primitive recursive functions is the smallest family 
of integer-to-integer functions with the following properties: 

(i) R contains the following base functions: 

0 (nullary constant), 

S{x) = X + 1 (successor function), 

U0‘(xi , . . . , Xn) = Xi (projection functions), for 1 < i < n. 
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(a) It is closed under the following operations: 

• composition: if h : N™ — >■ N, gi ■ N" — >■ N, • • • , : N" — >■ N 

are primitive recursive functions, then so is the function f : N" — > N 
defined as follows: 

f{xi, ...,Xn) = h{gi{xi , . . . , x„), . . . , gm{xi, Xn))- (1) 

• primitive recursion: if h : N" — > N, g '■ — > N are primitive 

recursive functions, then so is the function f : — > N defined as 

follows: 

f{0,Xi,...,Xn)=h{xi,...,Xn), . 2 '> 

f{z+l,Xi,..., Xn) = g{z, f{z, Xi,...,Xn),Xi,...,Xn), Z > 0. 

Theorem 1. The family of partial recursive functions is the smallest family of 
integer-to-integer functions with the following properties: 

(i) It eontains the nullary constant, the successor function and the projection 
functions. 

(a) It is closed under the operations composition, primitive recursion and min- 
imalization, defined as follows: 

If h : — > N ts a partial recursive function, then so is the function 

f : N" — >■ N, where 

f{xi, ...,Xn)= min{?/ € N\h{xi, ...,Xn,y) = 0}, (3) 

and h{xi, . . . ,Xn, z) is defined for all integers z, Q < z < y. Otherwise, 
f{xi, . . . , Xn) is undefined. 

3 Computation by Watson-Crick DOL Schemes 

In this section we define a representation of functions suitable for computing 
with Watson-Crick DOL schemes. Our approach is different from DOL growth 
functions which are frequently used in the cited literature. It is rather similar 
to the representation used with Turing machine. A substantial difference is the 
usage of different symbols for each function argument, since the Parikh vector 
of a DOL sequence does not depend on the order of symbols in an axiom. 

In the rest of the paper, we represent an n-tuple of nonnegative numbers 
by the Parikh vector of a string of symbols. Both this representation and the 
derivation in a Watson-Crick DOL scheme is independent on the order of symbols 
in a string. The following definitions utilize this fact to simplify notation. 

Definition 3. Let G = {S,p) he a Watson-Crick DOL scheme. For x,y G S* 
we write 

x^GV iff X =^G y', 

where y' is an arbitrary symbol permutation of y. We denote ^^6 transitive 

and reflexive closure of =^g usual. 
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Definition 4. Let f : N” — >■ N™ be a (partial) funetion and let G = {LJ,p) be a 
Watson-Crick DOL scheme. We say that G computes f if contains subsets 

r = {$, , . . . , ^[„]} and A = , ■ • ■ , -^[m]} such that the following holds: 

- fix ,, . . . , x„) = {yi, • . • , 2/m) iff $4) . . . . . . Zf™], 

- p{X) = X for X€ A, 

- either Iwj/i = 0 or w € A* for each w € S{G, i.e. no symbol 

from A appears in the derivation sequence until the output string is derived. 

The sets D and A are called the input set and the output set, respectively. We 
can assume without loss of generality that different schemes can have disjoint 
alphabets, simply by renaming the symbols of D and A. 

It immediately follows from the above definition that a scheme comput- 
ing a partial function / : N” — > N™ never produces any output symbol if 
f{xi, . . . ,Xn) is undefined for a certain n-tuple x\, ... ,Xn. It also follows that 
the result of computation is independent on order of symbols in the input string. 

Example 1. The following Watson-Crick DOL scheme G = {E,p) computes the 
function /(n) = [log 3 n] , where [x] is the ceiling of x. 

r = {$, A[1] , B, G, D, S, Z[,J ,#,$,AiO,B, G, D, S, #}, 

p($)=^ p{S)=# p{B) = \ p(S) = SGD 

p{G) = X p(A[i])=A p{B) = BBB p(D) = Z[,] 

and p{X) = X for all other X G E. At the first step, the symbols S and B 
are generated, while ^[i]’s representing the value of n remain unchanged. Then 
at each step the number of B’s is triplicated, and a pair GD is added. When 
the number of B's reaches the number of ’s, the complementary transition 
occurs, resulting in rewritting D to the output symbol For example, 

$yl9 j ^ ^ SiGDfWA\^^ 

SiCDfB^A([^^ #4, 

Notice again that contrary to DTOL system, the number of output symbols 
is nonlinear with respect to the number of input symbols, which is allowed by 
the complementarity triggering mechanism. 

4 Useful Properties of Watson-Crick DOL Schemes 

In this section we study ability of Watson-Crick DOL schemes to compose more 
complex functions from simpler ones. It is shown that certain operations over 
functions, even simpler than those in Section|2l can be realized by Watson-Crick 
DOL schemes. These results are of key importance for the next section. 

Definition 5. Let G = {E,p) be a Watson-Crick DOL scheme with an input set 
r and an output set A. G is said to be fc-lazy if for each axiom wq G E* and 
<7 > 0, p{wq) G PYR implies p{wq+i) G PUR and Wq+i ^ A* , 1 <i <k. 
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Informally, each complementation step is followed by at least k non-comple- 
mentation steps during which G does not produce an output. This property will 
turn out important for merging several schemes into one, as following lemmata 
do. It follows from the above definition that if G never performs a complemen- 
tation step during a derivation of S{G,wq), then G is /c-lazy for any fc > 0. 

Definition 6. A (partial) function f is said to be W^-computable, if there is a 
k-lazy Watson-Crick DOL scheme computing f. f is called to be W-computable, 
if it is Wk computable for some k > 0. 



Lemma 1. Let f : N” — ^ N™ be a Wi~ computable function, i >Q. Let £ > n, 
k < m, let P and Q be arbitrary permutations over the sets {!,...,£} and 
{1, . . . , m}, respectively. Then the following funetion g : — > N* is Wi~ 

computable: 



9{^P{1) > ■ ■ ■ ) — (J/Q(1) ) • ■ • 5 UQ(k) ) f (AIt ■ ■ ■ 1 ^n) — iui T ■ ■ ■ ^ ym) ■ 

Proof. Denote G = {LJ,p) a scheme computing /, with the input set P = 
{$, . . . , A[„]} and the output set A = . . . , Consider the 

scheme G' = {S' ,p') with the input set P' = {$', , . . . , A'^j} and the out- 
put set A' = {#', . . . , where E' = SU P' U A' U hw{r' U A'), 



p'(S') = $ 



p'm = #' 






£<j<n, 
n<j<£, 
l<j<k, 
k < j < m, 



p'{X) = p{X) for X G {X — A) and p'{X) = X for all other X G S' . Then 
obviously G' computes g. 



Lemma 2. Let functions f : N" — ^ N"* and g : N™ — >■ be Wi and Wj- 

computable, respectively. Then the function h : N" — ^ N^, h(x\, . . . ,Xn) = g ° 
f{xi , . . . , Xn), is Wk~ computable, where k = min(z, j) and gof is the composition 
of the functions f and g. 

Proof. We need only to rewrite output symbols of the scheme computing / to 
input symbols of the scheme computing g. We refer to H21 for details. 

The following two lemmata show that it is possible to “call” one function 
from another via Watson-Crick DOL schemes, having saved an actual state of 
computation of the calling function. 

Lemma 3. Let f : N” — > N be a function computed by a k-lazy Watson-Crick 
DOL scheme F, k >1, with an input set P = {$, A[i], . . . and an output set 

A = {ff,Z}. Then there is a k-lazy Watson-Crick DOL scheme Gi = (Si,pi) 
such that y = f{xi , . . . , a;„) ijf 

where -^[1]’ ■ ■ ■ ’ B[n],C[i], ..., C[n] G Si. 
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Proof. Denote F = (E,p), and let 

^PUR ^ ^PUR u ^ ^ ^ \l<J<n, l<i<k}, 

the sets on the the right hand side being disjoint. Let Ei = U hw{EY^^). 

Let Pi{X) = p{X) for each X G E. Consider an arbitrary input string 
Wq = and denote Wq, Wi,W 2 , ■ ■ ■ members of the sequence S{G, Wq). 

Below we define elements of pi and simultaneously show their effect on the 
derived string. 

Permutation of derived strings Elements of pi 



I < j < n 



Pl{By]) = By], Pi{Cy]) = Cy] 
I < j < n 



a={By]Cy]rE..{B]^]C]^]r-. (4) 

Consider now that p{wr) G PYR for some r > 0 and notice that pi(wra) G PYR 
'i'ff p{wr) G PYR. Then Gi performs a complementation step and its behavior 
will be the following: 

Permutation of derived strings Elements of pi 



Wo{B]i]C]i]Y^ . . . {B]n]G]n]Y'‘ 

wYB]^^[]YY..{B]^]^Y]Y'^ 

Denote for brevity 



WrCi 

Wr+ih{a) 

Wr+2{B]i]G]i]iY^ ■ . ■ (R[„]C'[„]i)“^ 



Pi (a) = a, see above 



Pl{By]) = A, Pl{Cy]) = B]j)Gyy 
1 < j <n 



Pi{Cy\i) = Cy]i+i 

l<z<A; 






( 5 ) 
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II. Pi((^[j]k) = C[j] 

l<j<n 

Wr+k+2 a for p{wr+k+i) G PUR, or 

Wr+k+2 Ha) for p{wr+k+i) G PYR. 



These strings contain the same symbols as the previously derived Wra or 
Wr+iHHi respectively, and the derivation will continue as above. The above 
derivation holds since F is fc-lazy by assumption and hence p{wr+i) G PUR, 
1 < i < k. All yet undefined elements of pi adopt the form pi{X) = X, X & Ei. 

Now consider that in a g-th step, g > 1, the string WqP was derived by Gi, 
where Wq = G A*, for some P G Sf. Then either WqP adopts the form o, 
if complementation occurred in {q — k)-th step, or /3 = a, if complementation did 
not occur in the last k steps since F is A:-lazy. In both cases a 

since #Z^P G PUR, and hence a permutation of the string ^Z^a is produced 
by Gi as the lemma claimed. 



Lemma 4. Let f : M" — > be a Wk eomputahle funetion, k > 1. Then the 
function g : N” — g(xi , . . . ,x„) = (xi, . . . ,x„,/(xi, . . . ,Xn)), is Wk-i~ 
computable. 

Proof. Proof of Lemma 01 showed that there is a scheme G\ such that y = 
/(xi, . . . , x„) iff . . . Af^^^Q^ffZy P, where #, Z did not appear in pre- 

vious members of the sequence, and moreover 

pMzyp) = ffzya = ffzy{B[,pH]T" • • • 

with a, /3 as in 0, 0. Nevertheless, the symbols 1 < j < n, could 

appear in previous members of the sequence. Now it remains to construct a 
scheme G 2 which rewrites all the symbols of ffZyp to its output symbols which 
did not appear in the sequence yet. 

Let G 2 = (A’ 2 ,P 2 ): denote and A 2 the input and the output set of G 2 , 
respectively. Let T 2 = A, L \2 = {#', . . . , Let 

^ y;f™Uzi 2 U{A, A | 0 < ■« < 2fc-l}U{G[j], \ k<i< 2k-l, 1 < j < n}, 
the sets on the the right hand side being mutually disjoint. Let A = U 

Let P 2 {X) = pi{X) for each X G Ei — A — {G[j]k | 1 < j < n}- Hence the 
derivation of G 2 is identical to that of Gi until the string ffZyp is produced, 
since the symbols Gj^j^ never appeared during this derivation. 

Having the string ffZyp, G 2 rewrites the symbols ff, Z to new symbols and 
performs a complementation step to rewrite P to h(a) (see the above proof for 
elements of pi). Then k — 1 non-complementation steps follow to keep the scheme 
(fc — l)-lazy. 
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Permutation of derived strings Elements of p2 



P2{#) = So, P2{Z) = Do 



SkDi(B\^]C[,^kr^ . . . 

In the last step another complementation occurred, rewriting the symbols 
C[j]fe in an other way than Gi would do. Notice that this complementation occurs 
only when the symbols Sk-i, appear in the sequence, which is possible only 
when a computation of j/ = f{xi , . . . , x„) is finished. Until then, the derivation 
was fully controlled by elements of pi . 

Permutation of derived strings Elements of p2 



P2{Si) = Si+i, P2{Di) = Di+i 

Q < i < k — 1 



P2{Sk-i) = Sk, P2{Dk-i) = Dk 



#zyp 

SoDlh{a) 






S2k-iDi^.,q^^^k-i---C\ 



[n]2k-l 






nr ry!x\ rvIXn ryty 
# ^[1] • • • ^[n+l] 



P2{Si) = Si+I, P2{Di) = A+1 
P2{C[j]i) = G[j]i+1 

k < i < 2 k — I 



P2{S2k-l) = #', 
P 2 {D 2 k-l) = 

P2{C[j]2k-l) = 1 < J < n 



The last k — 1 non-complementation steps was again to keep the scheme 
(A: — l)-lazy. All yet undefined elements ofp2 adopt the formp2(AT) = X, X G A2. 
It follows immediately from the above description that G2 is {k — l)-lazy and 
computes g, since the symbols from A2 did not appear in the derivation sequence 
until the output string was produced. 



Note. Consider that in the above lemma the function / is W^-computable for 
any fc > 0, and hence the scheme computing / never performs a complementation 
step. On the contrary, the resulting scheme G2 computing g does perform a 
complementation. Hence an arbitrary but fixed lazy value k must be chosen, 
according to which the construction of G2 is done, and the function g is then 
Wfc_i computable. 
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For the next lemma we fix the following notation: let / : N" — > N” be a 
(partial) function. Denoting o the composition operation, define 

/°(xi, ...,Xn) = {xi,. . .,Xn), 

f'^{xi,...,Xn) = fof'^-^{xi,...,Xn), k > 0. 



Lemma 5. Let f : N" — > N", n > 2, be a Wu-computable function, k > 1. 
Then so is the function g : N" — > N”, defined as 

g{xi, ...,Xn)= f{xi, . . .,Xn), 

i = min{£ | f{xi, . . . ,Xn) = {yi, ■ ■ ■ , Vn), Vi > 2/2}, 
if such an i exists, otherwise g{xi , . . . , Xn) is undefined. 

Proof. The principle is in computation of (jji, . . . , yn) = f{xi, . . . , Xn) and re- 
writting output symbols to the input ones, forming a loop. When the condition 
yi > J/2 holds, the complementation occurs, breaking the loop. We refer to ini 
for details. 

Notice that if no such i exists for a certain n-tuple x\, . . . ,Xn, then g{xi , . . . , Xn) 
is undefined and the resulting scheme computing g never produces an output 
symbol according to Definition 0 

5 Main Result 

We show that any partial recursive function can be computed by a Watson-Crick 
DOL scheme. The proof is based on the fact that the base functions and opera- 
tions in Theorem n can be realized by Watson-Crick DOL schemes. Technically, 
these operations can be decomposed into simpler parts, which were shown to be 
W-computable in the previous section. 

Lemma 6. The following functions are Wk~ computable for any k > 0.' (i) the 
nullary constant 0, (ii) the successor function S{x), (Hi) the projection function 
Uff{xi,...,Xn). 

Proof (i) Consider the scheme G = {{$, A,ff, Z,$, A,ff, Z},p) such that 
p($) = #, p{A) = X and p{X) = X for all other X G X. 

(ii) Consider the scheme G = {{$,# A,Z,$,ff,A,Z},p) such that p{$) = ffZ, 
p{A) = Z and p{X) = X for all other X G X. 

(iii) Consider the scheme G = ({$, # A[i] ,..., A [„] , Z, $, #, A[i] ,..., A^^] ,Z},p, $) 
such that p{%) = #, p(A[j]) = Z, p{Ay^) = A for j yf i and p{X) = X for all 
other X G X. 



Lemma 7. Let h : N™ — >■ N, gi ■ N” — ^ N, ■ • • , : N" — ^ N be Wi~ 

computable functions, i > 1. Then the function f : N” — > N, f{xi, . . . ,Xn) = 
h{gi{xi, . . . ,Xn), . . . ,gm{xi, . . . ,Xn)), is Wi-i~ Computable. 
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Proof. We can derive according to previous lemmata that the following functions 
are Wi_i-computable: 

- /i : N” — defined as /i(a;i, ...,Xn) = {xi, . . .,Xn,yi), where yi = 
gi{xi, . . . , Xn), due to Lemma 0 ] 

- f2{xi, ...,Xn) = {xi,. . .,Xn,yi,y2), where 1/2 = g2{xi, . . .,Xn), since 

• 52(^11 ■ • ■ ^Xn,yi) = g2(xi, . . . ,Xn) is Wfe-computable due to Lemma IH 

• 52(3^1, ■ • ■,Xn,yi) = (xi , . . . ,a;„, 1/1,32 (a^i, ■ ■ -,x„,yi)) is Wfc_i-compu- 
table due to Lemma 0 and 

• f2{xi , . . . , Xn) = g2°fi{xi, . . . , Xn) is Wfc_i-computable due to LemmaEl 

- f 3 {xi,...,Xn) = {xi,. ■ . ,Xn,yi,y 2 ,y 3 ), where 2/3 = g 3 {xi,...,Xn), due to 
Lemmata n El and El as in the construction of /2 above; 

- fm{xi, ...,Xn) = (xi, . . -,Xn,yi, ■ ■ ■ , 2 /m), where ym = gra{xi, . . .,Xn), ana- 
logously as in the construction of /2 above; 

- /m-i-i(a:i, . . . ,a;„) = (xi, . . . , ?/i, ■ ■ ■ , 2 /m, /i( 2 /i, • ■ ■ , 2 /m)) as in the con- 
struction of /2 above; 

- /(xi, . . . ,x„) = o fm,+i{xi, . .. ,Xn) due to Lemmata Hand El (iii). 

We constructed the function / using the constructions of the Lemmata Q El El 
and El in a straightforward manner, hence / is Wi_i-computable function. 

Lemma 8 . Let h : N" — /■ N, g ■ — >■ N he Wi-eomputable funetions, 

i > 1 . Then the funetion f : — > N defined as in is Wi-i-eomputable. 

Proof. The following functions are Wi_i-computable: 

- S'{y, z,w,xi, . . . ,Xn) = S{y) due to Lemmata ^ and El (h); 

- /i : def. as fi{y, z,w,xi, . . . ,Xn) = {y,z,w,xi ,.. . ,x„,2/i), 

where 1/1 = S'{y, z,w,Xi, . . . , x„), due to Lemma calling 2 ; 

- f2{y,z,w,xi,...,Xn) = {y,z,w,xi,...,Xn,yi,y2), where 

2/2 = g(y, w, Xi, . . . , Xn), analogously to the construction of /2 in the above 
proof; 

- h{y,z,w,Xi , . . . ,x„) = (2/1, z, 2/2, 2 ^ 1 , ■■■,Xn) due to LemmalU 

- f4{y, z,w,xi, . . . , Xn) = /a (2/, z,w,x\, . . . , x„), where k > 0 is minimal such 
that y > z, due to Lemma 0 

Consider y = 0 , then the calculation of f4{y, z,w, xi, . . . , x„) will proceed as 
follows: w := g{y,w,xi, . . . ,x„), y := S{y), repeatedly while y < z, i.e. z-times. 
If we at the beginning assume w = h{x \, . . . , x„), then value of w is computed 
exactly according to m in Definition El of the primitive recursion. 

Notice that if g and h are defined for all values of arguments, then so is /i 
since the computation according to Lemma 0 finishes after z cycles. It remains 
to show that the following functions are Wi_i-computable, establishing w as the 
value computed by the function / : 

- hi{z,X\, . . . , Xn) = h{x \, . . . , Xn) due to Lemma 0 
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— h 2 {z, Si, . . . , Xn) = {z, xi , . . . , Xn, hi{z, Xi , . . . , Xn)) du6 to Lemma El 

— h^{z,xi, . . . ,Xn) = {z,Xi, . . . ,Xn,hi{z,xi, . . . ,Xn),0) analogously to the 
construction of /2 in the above proof, having 0 the nullary function; 

— ft. 4 (z, x \, . . . , Xn) = (0, z, h(xi, . . . , Xn),xi, . . . , Xn) duc to Lemma^ 

— /s(z, cci, . . . , Xn) = /4 o hi{z, xi, . . . , Xn) duc to Lemma|2 

— f{z, xi, . . . , Xn) = o /s(z, xi, . . . , Xn) duc to Lemmata El and El (iii). 

Lemma 9. Let h : — > N &e a Wi~ computable function, i >1. Then the 

function f : N” — )■ N, f{xi, . . . ,Xn) = min{?/ e N , a;„, y) = 0} is 

Wi-i-computable. 

Proof. The following functions are Wi_i-computable: 

— fi{v,w,xi,...,Xn,y,z) = {v,h{xi,. . . ,Xn,y),Xi,. . . ,Xn,S{y),y), analogo- 
usly to the construction of fi in the proof of Lemma El 

— f 2 {v,w,Xi, ...,Xn,y,z) = ff{v,w,Xi, ... ,Xn,y, z) , where A: > 0 is minimal 
such that V > w, due to Lemma El 

Consider that we compute /2 with v = y = z = 0,w=l, then its values will 
be calculated as follows: w := h{xi , . . . , a:„, y),y := S{y), z := y, repeatedly until 
h{xi, . . . , Xn, y) = 0. Taking as function value the value of y in the previous step 
(which is stored in 2 at that moment), we obtain exactly the value in 0 where 
the minimalization is defined. 

Notice that if no y G N exists such that h{xi , . . . , Xn, y) = 0, or h{xi , . . . , Xn, z) is 
undefined for some z < y, then /(xi, . . . , Xn) is undefined and the computation 
of / according to Lemma 0 never produces an output symbol. 

It remains to show that the following functions are Wi_i-computable: 

“ fsixi, . . . , Xn) = /2(0, 1, Xi, . . . , Xn, 0, 0), due to Lemmata [0 0 0 and 0 
(i),(ii), analogously to the construction of hi, . . . ,hi and f^ in the above 
proof; 

— /(xi, . . . , Xn) = o fs{xi, . . . , Xn) due to Lemmata0 and 0 (iii). 

Theorem 2. Any (partial) recursive function is W-computable. 

Proof. Follows from TheoremQand from Lemmata0-0 Notice that a necessary 
condition for Lemmata0-0is that all the functions that are subject to these 
operations must be W^-computable, i > 1. Taking into account that the result of 
each elementary operation is then Wi_i computable, we can ask what happens 
when the value 0 is reached. 

Fortunately, when constructing any (partial) recursive function, we always 
start with base functions which are computable for each i, and then we apply 
only a fixed number of the operations. This number is of course independent on 
the function arguments. During the first application of these operation over the 
base functions, a construction according to Lemma0 is applied and an arbitrary 
but fixed lazy value must be chosen (see also note following Lemma 0). Hence it 
is enough to choose this value larger than the number of the operations further 
applied, so that the lazy value 0 is never reached. 

Corollary 1. The universal function is W-computable. 
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6 Conclusions 

We studied the properties of Watson-Crick complementation in the framework 
of deterministic Lindenmayer systems. We obtained an universal computational 
power only by enhancing the DOL iterated morphism by a complementation 
mechanism, which is one of the basic DNA operations. This unexpected result 
may be inspiring for both biological and computational research in molecular 
computing. There may be also further consequences of the result in the the- 
ory of formal languages and automata. Notice, however, that for more complex 
functions the number of necessary symbols in the resulting Watson-Crick DOL 
schemes can be enormously large. 
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